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INTRODUCTION 


Sus are a powerful tool for finding patterns in data and inferring 
important connections between events in the real world. We are accus- 
tomed to seeing statistics every day — in the news, for example. However, 
many people find statistics confusing and intimidating. Often statistics are 
considered to make an argument irrefutable. On the other hand, some 
people perceive statistics as a trick, used to support outrageous claims. 


This book is intended to clarify some of the elements of statistical reason- 
ing and analysis. The format is simple, and there are many examples to 
help translate abstract ideas into concrete applications. 


Why You Need This Book 


Can you answer yes to any of these questions? 
M Do you need to review the fundamentals of statistics fast? 
M Do you need a course supplement to statistics? 
M Do you need a concise, comprehensive reference for statistics? 


M Do you need to prepare for a statistics exam? 


If so, then CliffsQuickReview Statistics is for you! 


How to Use This Book 


Youre in charge here. You get to decide how to use this book. You can read 
it cover to cover or just look up the information you need right now. Here 
are some recommended ways to search for topics. 


M Use the Pocket Guide to find essential information, such as basic 
statistics formulas and abbreviations. 

M@ Flip through the book looking for subject areas at the top of each 
page. 

M Geta glimpse of what you'll gain from a chapter by reading through 
the “Chapter Check-In” at the beginning of each chapter. 
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M Use the “Chapter Checkout” at the end of each chapter to gauge your 
grasp of the important information you need to know. 


M@ Test your knowledge more completely in the CQR Review and look 
for additional sources of information in the CQR Resource Center. 


M Use the glossary to find key terms fast. This book defines new terms 
and concepts where they first appear in the chapter. Ifa word is bold- 
faced, you can find a more complete definition in the book’s glossary. 


M@ Or flip through the book until you find what you're looking for—we 
organized this book to gradually build on key concepts. 


Visit Our Web Site 


A great resource, www.cliffsnotes.com, features review materials, valu- 
able Internet links, quizzes, and more to enhance your learning. The site 
also features timely articles and tips, plus downloadable versions of many 


CliffsNotes books. 


When you stop by our site, don’t hesitate to share your thoughts about this 
book or any Hungry Minds product. Just click the Talk to Us button. We 
welcome your feedback! 


Chapter 1 
OVERVIEW 


Chapter Check-In 


Learning about the different kinds of statistics 


Understanding the steps in statistical reasoning 


Learning several uses for statistical reasoning 


FF many people, statistics means numbers—numerical facts, figures, or 
information. Reports of industry production, baseball batting averages, 
government deficits, and so forth, are often called statistics. To be precise, 
these numbers are descriptive statistics because they are numerical data 
that describe phenomena. Descriptive statistics are as simple as the num- 
ber of children in each family along a city block or as complex as the annual 
report released from the U.S. Treasury Department. 


Types of Statistics 


In these pages, we are concerned with two ways of representing descrip- 
tive statistics: numerical and pictorial. 


Numerical statistics 

Numerical statistics are numbers, but clearly, some numbers are more 
meaningful than others. For example, if you are offered a purchase price of 
$1 for an automobile on the condition that you also buy a second auto- 
mobile, the price of the second automobile would be a major consideration 
(its price could be $1,000,000 or only $1,000), and thus, the average—or 
mean—of the two prices would be the important statistic. The different 
meaningful numerical statistics are discussed in Chapter 3. 


Pictorial statistics 
Taking numerical data and presenting it in pictures or graphs is what is 
known as pictorial statistics. Showing data in the form of a graphic can 
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make complex and confusing information appear more simple and 
straightforward. The more commonly used graphics in statistics are dis- 
cussed in Chapter 2. 


Method of Statistical Inference 


Statistics is also a method, a way of working with numbers to answer puz- 
zling questions about both human and non-human phenomena. Questions 
answerable by using the “method” of statistics are many and varied: Which 
of several techniques is best for teaching reading to third-graders? Will a 
new medicine be more effective than the old one? Can you expect it to 
rain tomorrow? What’s the probable outcome of the next presidential elec- 
tion? Which assembly-line process produces fewer faulty carburetors? How 
can a polling organization make an accurate prediction of a national elec- 
tion by questioning only a few thousand voters? And so on. 


For our purposes, statistics is both a collection of numbers and/or pictures 
and a process: the art and science of making accurate guesses about out- 
comes involving numbers. 


So, fundamentally, the goals of statistics are 
mM To describe sets of numbers 


M@ To make accurate inferences about groups based upon incomplete 
information 


Steps in the Process 


Making accurate guesses requires groundwork. The statistician must do 
the following in order to make an educated, better-than-chance hunch: 


1. Gather data (numerical information). 
2. Organize the data (sometimes in a pictorial). 
3. Analyze the data (using tests of significance and so forth). 


This book shows you how to do precisely these procedures and finally how 
to use your analysis to draw an inference—an educated statistical guess— 
to solve a particular problem. While these steps may appear simple (and 
indeed some of them are), sound statistical method requires that you per- 
form them in certain prescribed ways. Those ways are the heart and soul 
of statistics. 
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Making Predictions 


Suppose that you decide to sell commemorative T-shirts at your town’s 
centennial picnic. You know that you can make a tidy profit, but only if 
you can sell most of your supply of shirts because your supplier won't buy 
any of them back. How many shirts can you reasonably plan on selling? 


Your first question, of course, is this: How many people will be attending 
the picnic? Suppose you know that 100,000 tickets have been sold to the 
event. How many T-shirts should you purchase from your supplier to sell 
on the day of the event? 10,000? 50,000? 100,000? 150,000? How many 
of each size—small, medium, large, extra large? And the important ques- 
tion: How many must you sell in order to make some profit for your time 
and effort and vot be left with an inventory of thousands of unsold T-shirts? 


Ideally you need to have an accurate idea—before you buy your inventory 
of T-shirts—just how many ticket holders will want to purchase centennial 
T-shirts and which sizes they will want. But, obviously, you have neither 
the time nor the resources to ask each of those 100,000 people if they plan 
to purchase a commemorative T-shirt. If, however, you could locate a small 
number of those ticket holders—for example, 100—and get an accurate 
count of how many of those 100 would purchase a T-shirt, you'd have a 
better idea of how many of the 100,000 ticket holders would be willing to 
buy one. 


That is, of course, if the 100 ticket holders that you asked (called the sam- 
ple) are not too different in their intentions to purchase a T-shirt from the 
total 100,000 ticket holders (called the population). If the sample is indeed 
representative (typical) of the population, you could expect about the same 
percentage of T-shirt sales (and sizes) for the population as for the sample, 
all things being equal. So if 50 of your sample of 100 people say they can’t 
wait to plunk down $10 for a centennial T-shirt, it would be reasonable 
to expect that you would sell about 50,000 T-shirts to your population of 
100,000. (At just $1 profit per T-shirt, that’s $50,000!) 


But before you start shopping to buy a yacht with your profits, remember 
that this prediction of total T-shirt sales relies heavily upon the sample 
being representative (similar to the population), which may not necessar- 
ily be the case with your sample. You may have inadvertently selected a 
sample that has more expendable income or is a greater proportion of sou- 
venir T-shirt enthusiasts or who knows what else. Are you reasonably cer- 
tain that the intentions of the sample of 100 ticket holders reflect the 
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intentions of the 100,000 ticket holders? If not, you may quite possibly 
be stuck with tens of thousands of centennial T-shirts and no profit to 
splurge on a yacht. 


You can see why choosing a random sample is a critical part of the process of 
statistics and why this book includes a chapter on sampling (Chapter 5). 


Comparing Results 


Making predictions is only one use of statistics. Suppose you have recently 
developed a new headache/pain remedy that you call Ache-Away. Should 
you produce Ache-Away in quantity and make it available to the public? 
That would depend, among other concerns, upon whether Ache-Away is 
more effective than the old remedy. How can you determine that? 


One way might be to administer both remedies to two separate groups of 
people, collect data on the results, and then statistically analyze that data 
to determine if Ache-Away is more effective than the old remedy. And what 
if the results of this test showed Ache-Away to be more effective? How cer- 
tain can you be that shis particular test administration is indicative of all 
tests of these two remedies? Perhaps the group taking the old remedy (the 
control group) and the group taking Ache-Away (the treatment group) 
were so dissimilar that the results were due vor to the pain remedies but to 
the differences between the groups. 


It’s possible that the results of this test are far off the results that you would 
get if you tried the test several more times. You certainly do not want to 
foist a questionable drug upon an unsuspecting public based upon untyp- 
ical test results. How certain can you be that you can put your faith in the 
results of your tests? You can see that the problems of comparing headache 
remedy results can produce headaches of their own. 


This type of statistical procedure is discussed in Chapters 6 and 7. 


Probability 


One of the most familiar uses of statistics is to determine the chance of 
some occurrence. For instance, what are the chances that it will rain tomor- 
row or that the Boston Red Sox will win a World Series? These kinds of 
probabilities, although interesting, are not the variety under discussion 
here. Rather, we are examining the probability in statistics that deals with 
classic theory and frequency theory—events that can be repeated over and 
over again, independently, and under the same conditions. 
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Coin tossing and card drawing are two such examples. A fair coin (one 
that is not weighted or fixed) has an equal chance of landing heads as land- 
ing tails. A typical deck of cards has 52 different cards—13 of each suit 
(hearts, clubs, diamonds, and spades)—and each card or suit has an equal 
chance of being drawn. This kind of event forms the basis of your under- 
standing of probability and enables you to find solutions to everyday prob- 
lems that seem far removed from coin tossing or card drawing. 


Probability is discussed in Chapter 4. 


Common Statistics Mistakes 


Whether you are new to the study of statistics or have been using statisti- 
cal analysis for years, there is a likelihood that you will make (or already 
have made) errors in your application of principles explained in this book. 
In fact, some particular errors are committed so frequently in statistical 
analysis that we feel obliged to call them to your attention in the hope that 
by, being prepared for them, you will avoid their pitfalls. Appendix A lists 
these common mistakes. 


Chapter Checkout 


1. True or False: Statistics are only about things in the past, so you can't 
use them to make predictions. 


2. True or False: An inference is an educated guess, made after analyzing 
data. 


3. True or False: A population is a randomly chosen portion of a larger 
sample. 


4. True or False: Statistical analysis will allow you to be absolutely sure 
of your conclusions. 


Answers: 1. False 2. True 3. False 4. False 


Chapter 2 
GRAPHIC DISPLAYS 


Chapter Check-In 


Displaying data in simple graphical formats such as bar charts, pie 
charts, dot plots, and ogives 


Constructing frequency histograms and distributions from data and 
learning about the basic properties of these displays 


Using stem-and-leaf plots and box plots to display numerical mea- 
sures of data 


O Understanding scatter plots 


Pp charts, bar charts, and histograms are graphic displays of numeri- 
cal data—simple collections of numbers made more understandable 
by use of visuals. This section discusses the more common graphic displays 
used in statistical analysis. 


As an example, consider the yearly expenditures of a college undergradu- 
ate. After collecting her data (expense records) for the past year, she finds 
the expenditures shown in Table 2-1. 


Table 2-1 Yearly Expenses of a College Undergraduate 


Item Amount 
Tuition fees $5,000 
Room and board 9,000 
Books and lab 2,000 
Clothes/cleaning 1,000 
Transportation 2,000 


Insurance and miscellaneous 1,000 
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These figures, although presented in categories, do not allow for easy anal- 
ysis. The reader must expend extra effort in order to compare amounts 
spent or relate individual proportions to the total. For ease of analysis, these 
data can be presented pictorially. 


Bar Chart 


One way to pictorially display the numbers shown in Table 2-1 is with a 
bar chart (see Figure 2-1). 


Figure 2-1 Vertical bar chart presentation of the expenditures of a 
college undergraduate for the past year. 


= 9000 


6000 


3000 
0 — 


Room Tuition Transpor- Books Clothes & Ins. & 
& Board Fees tation &Lab Cleaning Misc. 


Amount of 
Money Spent ($ 


Comparing the size of the bars, you can quickly see that room and board 
expenses are nearly double tuition fees, and tuition fees are more than dou- 
ble books and lab or transportation expenses. 


A bar chart may also be placed on its side with the bars going horizontally, 
as shown in Figure 2-2. 


Figure 2-2 Horizontal bar chart presentation of the expenditures of a 
college undergraduate for the past year. 
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In each bar chart, vertical or horizontal, the amounts are ordered from 
highest to lowest or lowest to highest, making the chart clearer and more 
easily understood. Space is left between each of the bars in order to define 
the categories as being different. 


The bottom line in Figure 2-1 and the left-hand side in Figure 2-2 indi- 
cate 0. Although typical, this presentation need not always be used. Finally, 
while the lengths of the bars may be different, their thicknesses are the same. 


Dot Plot 


Dot plots are similar to bar graphs. Typically used for a small set of val- 
ues, a dot plot uses a dot for each unit of measurement; thus, the preced- 
ing undergraduate expense data represented in a dot plot could look like 
that shown in Figure 2-3. 


Figure 2-3 Dot plot of the expenditures of a college undergraduate for 
the past year. 
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Pie Chart 


The previous charts have a limitation: It’s difficult to see what portion of 
the total each item comprises. If knowing about a “part of the whole” is 
an important consideration, then a pie chart is a better choice for show- 
ing the same data. A pie chart may also display each category’s percentage 
of the total. Using the same data from the undergraduate expenditures, we 
get the pie chart shown in Figure 2-4. 


The parts of the circle (or pie) match in size each category’s percentage of 
the total. The parts of the pie chart are ordered from highest to lowest for 
easier interpretation of the data. Pie charts work best with only a few cat- 
egories; too many categories will make a pie chart confusing. 
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Figure 2-4 Pie chart presentation of the expenditures of a college 
undergraduate for the past year. 
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Transportation 
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Ogive 


Data may be expressed using a single line. An ogive (a cumulative line 
graph) is best used when you want to display the total at any given time. 
The relative slopes from point to point will indicate greater or lesser 
increases; for example, a steeper slope means a greater increase than a more 
gradual slope. An ogive, however, is not the ideal graphic for showing com- 
parisons between categories because it simply combines the values in each 
category and thus indicates an accumulation, a growing or lessening total. 
If you simply want to keep track of a total and your individual values are 
periodically combined, an ogive is an appropriate display. 


For example, if you saved $300 in both January and April and $100 in 
each of February, March, May, and June, an ogive would look like 
Figure 2-5. 


Figure 2-5 Ogive of accumulated savings for one year. 
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An ogive displays a running total. Although each individual month's sav- 
ings could be expressed in a bar chart (as shown in Figure 2-6), you could 
not easily see the amount of total growth or loss, as you can in an ogive. 


Figure 2-6 Vertical bar chart of accumulated savings for one year. 


Savings Each Month 
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& 
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The choice of graphic display, therefore, depends on what information is 
important for your purposes: percentages (parts of the whole), running 
total, comparisons of categories, and so forth. 


Frequency Histogram 


One of the more commonly used pictorials in statistics is the frequency 
histogram, which in some ways is similar to a bar chart and tells how many 
items are in each numerical category. For example, suppose that after a 
garage sale, you want to determine which items were the most popular: 
the high-priced items, the low-priced items, and so forth. Let’s say you sold 
a total of 32 items for the following prices: $1, $2, $2, $2, $5, $5, $5, $5, 
$7, $8, $10, $10, $10, $10, $11, $15, $15, $15, $19, $20, $21, $21, $25, 
$25, $29, $29, $29, $30, $30, $30, $35, $35. 


The items sold ranged in price from $1 to $35. First, divide this range of $1 
to $35 into a number of categories, called class intervals. Typically, no fewer 
than 5 or more than 20 class intervals work best for a frequency histogram. 


Choose the first class interval to include your lowest (smallest value) data 
and make sure that no overlap exists so that one piece of data does not fall 
into two class intervals. For example, you would not have your first class 
interval be $1-$5 and your second class interval $5-$10 because the four 
items that sold for $5 would belong in both the first and the second inter- 
vals. Instead, use $1-$5 for the first interval and $6-$10 for the second. 
Class intervals are mutually exclusive. 
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First, make a table of how your data is distributed. The number of obser- 
vations that falls into each class interval is called the class frequency. See 


Table 2-2. 


Table 2-2 Distribution of Items Sold at Garage Sale 
Class Interval Frequency 
1 $1-$5 
$6-$10 
$11-$15 
$16-S20 
$21-$25 
$26-$30 
$31-$35 


N [OO ]oa!/F | wln 
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Note that each class interval has the same width. That is, $1-$5 has a width 
of five dollars, inclusive; $6-$10 has a width of five dollars, inclusive; 
$11-$15 has a width of five dollars, inclusive, and so forth. From the data, 
a frequency histogram would look like that shown in Figure 2-7: 


Figure 2-7 Frequency histogram of items sold at a garage sale. 
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$1-5 $6-10 $11-15 $16-20 $21-25 $26-30 $31-35 


Unlike in a bar chart, the class intervals are drawn immediately adjacent 
to each other. 


Relative Frequency Histogram 


A relative frequency histogram uses the same information as a frequency 
histogram but compares each class interval to the total number of items. 
For example, the first interval ($1-$5) contains 8 out of the total of 32 items, 
so the relative frequency of the first class interval is 8/32. See Table 2-3. 
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Table 2-3 Distribution of Items Sold at Garage Sale, Including 
Relative Frequencies 


Class Interval Frequency Relative Frequency 
1 $1-$5 8 25 

2 $6-$10 6 .1875 

3 $11-$15 4 125 

4 $16-S20 2 .0625 

5 $21-$25 4 125 

6 $26-$30 6 .1875 

7 $31-$35 2 .0625 


The only difference between a frequency histogram and a relative frequency 
histogram is that the vertical axis uses relative or proportional frequency 
instead of simple frequency (see Figure 2-8). 


Figure 2-8 Relative frequency histogram of items sold at a garage sale. 
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Frequency Polygon 


Relative frequencies of class intervals can also be shown in a frequency 
polygon. In this chart, the frequency of each class is indicated by points 
or dots drawn at the midpoints of each class interval. Those points are then 
connected by straight lines. 


Comparing the frequency polygon (shown in Figure 2-9) to the frequency 
histogram (refer to Figure 2-7), you will see that the major difference is 
that points replace the bars. 


Chapter 2: Graphic Displays 15 


Figure 2-9 Frequency polygon display of items sold at a garage sale. 
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Whether to use bar charts or histograms depends on the data. For example, 
you may have qualitative data—numerical information about categories 
that vary significantly in kind. For instance, gender (male or female), types 
of automobile owned (sedan, sports car, pickup truck, van, and so forth), 
and religious affiliations (Christian, Jewish, Moslem, and so forth) are all 
qualitative data. On the other hand, quantitative data can be measured in 
amounts: age in years, annual salaries, inches of rainfall. Typically, qualita- 
tive data are better displayed in bar charts, quantitative data in histograms. 
As you will see, histograms play an extremely important role in statistics. 


Frequency Distribution 


Frequency distributions are like frequency polygons (refer to Figure 2-9); 
however, instead of straight lines, a frequency distribution uses a smooth 
curve to connect the points and, similar to a graph, is plotted on two axes: 
The horizontal axis from left to right (or x axis) indicates the different pos- 
sible values of some variable (a phenomenon where observations vary 
from trial to trial). The vertical axis from bottom to top (or y axis) mea- 
sures frequency or how many times a particular value occurs. 


For example, in Figure 2-10, the x axis might indicate annual income (the 
values would be in thousands of dollars); the y axis might indicate fre- 
quency (millions of people or percentage of working population). Notice 
that in Figure 2-10 the highest percentage of the working population 
would thus have an annual income in the middle of the dollar values. The 
lowest percentages would be at the extremes of the values: nearly 0 and 
extremely high. 
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Figure 2-10 Asymmetric bell curve. 


Notice that this frequency curve displays perfect symmetry; that is, one half 
(the left side) is the mirror image of the other half (the right side). A bell- 


shaped or mound-shaped curve is also normal, giving it special properties. 


The negatively skewed curve, shown in Figure 2-11, is skewed to the lefi. 
Its greatest frequency occurs at a value near the right of the graph. 


Figure 2-11 Negatively skewed bell curve. 


ray 


The positively skewed curve (see Figure 2-12) is skewed to the right. Its 
greatest frequency occurs at a value near the left of the graph. This distri- 
bution is probably a more accurate representation of the annual income 
of working Americans than is Figure 2-10. 


Figure 2-12 Positively skewed bell curve. 


Figure 2-13 shows a J-shaped curve. 
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Figure 2-13 J-shaped curve. 


Unlike Figure 2-10, a bimodal curve (shown in Figure 2-14) has two high 
points. 


Figure 2-14 A bimodal curve has two maximum peaks. 


Stem-and-Leaf 


Another useful pictorial is the stem-and-leaf. It is similar to a histogram 
in that it shows the range of data, where the data are concentrated, if there 
are any outliers (occasional extremely high or extremely low scores), and 
the general shape of the distribution. 


For example, look at the following data—test scores of 17 high school stu- 
dents: 69, 75, 77, 79, 82, 84, 87, 88, 89, 89, 89, 90, 91, 93, 96, 100, 100. 
The stem-and-leaf takes the first digit of each score as the stem and uses 
the remaining digits as the leaf. 


As an example, for the score of 69, the 6 is the stem and the 9 is the leaf; 
for the next three grades (75, 77, and 79), 7 is the stem, and 5, 7, and 9 


are the leaves. 


Note, too, that along the extreme left side of the chart is a vertical column 
that keeps a running count or total. (Some stem-and-leafs do not include 
this running total.) Having a running total enables the reader to locate 
quickly the median. Median is discussed further in Chapter 3. 


The completed stem-and-leaf for the high school students’ test scores looks 
like that shown in Table 2-4. 


18 CliffsQuickReview Statistics 


Table 2-4 Stem-and-Leaf Display of Students’ Test Scores 


Running Count Stem Leaves 

1 6 9 

4 7 579 

11 8 2478999 
15 9 0136 

17 10 00 


Notice that, like a histogram, each stem determines a class interval and, 
also like a histogram, the class intervals are all equal. (In this case, each 
interval width is from a possible low of 0 to a possible high of 9.) All 17 
scores are displayed in the stem-and-leaf so that you can see not only the 
frequencies and the shape of the distribution but also the actual value of 
every score. 


Box Plot (Box-and-Whiskers) 


Box plots, sometimes called box-and-whiskers, take the stem-and-leaf 
one step further. A box plot will display a number of values of a distribu- 
tion of numbers: 


The median value 
The lower quartile (Q;) 
The upper quartile (Q;) 


The interquartile range or QR (distance between the lower and upper 
quartiles) 


The symmetry of the distribution 


The highest and lowest values 


Use the set of values in Table 2-5 to examine each of the preceding items. 


Table 2-5 Verbal SAT Scores of 20 High School Students. 
280, 340, 440, 490, 520, 540, 560, 560, 580, 580, 


600, 610, 630, 650, 660, 680, 710, 730, 740, 740 
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The median (the middle value in a set that has been ordered lowest to high- 
est) is the value above which half of the remaining values fall and below 
which the other half of the remaining values fall. Because there are an even 
number of scores in our example (20), the median score is the average of 
the two middle scores (10th and 11th)—580 and 600—or 590. 


The lower quartile (Q,or 25th percentile) is the median of the bottom 
half. The bottom half of this set consists of the first 10 numbers (ordered 
from low to high): 280, 340, 440, 490, 520, 540, 560, 560, 580, 580. The 
median of those 10 is the average of the 5th and 6th scores—520 and 
540—or 530. The lower quartile score is 530. 


The upper quartile (Q;or 75th percentile) is the median score of the top 
half. The top half of this set consists of the last 10 numbers: 600, 610, 630, 
650, 660, 680, 710, 730, 740,740. The median of these 10 is again the 
average of the 5th and 6th scores—in this case, 660 and 680—or 670. So 
670 is the upper quartile score for this set of 20 numbers. 


A box plot can now be constructed as follows: The left side of the box 
indicates the lower quartile; the right side of the box indicates the upper 
quartile; and the line inside the box indicates the median. A horizontal 
line is then drawn from the lowest value of the distribution through the 
box to the highest value of the distribution. (This horizontal line is the 
“whiskers.”) 


Using the Verbal SAT scores in Table 2-5, a box plot would look like 
Figure 2-15. 


Figure 2-15 A box plot of SAT scores displays median and quartiles. 


200 300 400 500 600 700 800 


Qy= 530 Median Q3= 670 
590 


Without reading the actual values, you can see by looking at the box plot 
in Figure 2-15 that the scores range from a low of 280 to a high of 740; 
that the lower quartile (Q,) is at 530; the median is at 590; and the upper 
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quartile (Q;) is at 670. Because the median is slightly nearer the lower quar- 
tile than the upper quartile and the interquartile range is situated far to 
the right of the range of values, the distribution departs from symmetry. 


Scatter Plot 


Sometimes you want to display information about the relationship involv- 
ing two different phenomena. For example, suppose you collected data 
about the number of days that law school candidates studied for a state 
bar examination and their resulting scores on the exam. The data from 
eight candidates is shown in Table 2-6. 


Table 2-6 Law School Candidates’ Prep Times and Test Scores 


Candidate # 1 2 3 4 5 6 7 8 
Days studied 7 9 5 1 8 4 3 6 


Score earned 23 25 14 #5 22 15 11 17 


One dot would then be plotted for each examinee, giving a total of only 
8 dots, yet displaying 16 pieces of numerical information. For example, 
candidate #1 studied for 7 days and received a score of 23. Candidate 
#1’s dot would be plotted at a vertical of 23 and a horizontal of 7 (see 
Figure 2-16). 


Figure 2-16 A representative point of data on a scatter plot. 
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A completed scatter plot would look like Figure 2-17. 
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Figure 2-17 A scatter plot displaying the relationship between prepara- 
tion time and test score. 
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There is a strong positive relationship between the number of days stud- 
ied and the score on the bar exam; that is, the data displayed indicates that 
an increase in days studied for the exam correlates with an increase in score 
achieved. A negative relationship would be indicated if the dots suggest 
a line going down from left to right, meaning that as one variable increases, 
the other decreases. And no relationship would be indicated if the scatter 
plot dots suggest a completely horizontal line, a completely vertical line, 
or no line at all (see Figure 2-18). 


Figure 2-18 Scatter plots which display no relationship between the 
variables plotted. 


These relationships are discussed in more detail in Chapter 8, which dis- 
cusses bivariate relationships. 


22 CliffsQuickReview Statistics 


Chapter Checkout 


1. A car salesman takes inventory and finds that he has a total of 125 
cars to sell. Of these, 97 are the 2001 model, 11 are the 2000 model, 
12 are the 1999 model, and 5 are the 1998 model. Which two types 
of charts are most appropriate to display the data? Construct one of 
the plots. 


2. Given the bar graph shown here, take the information and construct 
a scatter plot. 
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a. How many total accidents have occurred at the intersection of 
South and Main Streets in the last year? 

b. How many occurred between June and August? 

c. What interval saw the greatest increase in the rate of accidents? 


Answers: 1. Pie or bar 2. Provide your own answer. 3a. 18 b. 2 c. May-June 


Chapter 3 
NUMERICAL MEASURES 


Chapter Check-In 


Calculating and understanding the central tendencies of data 


4 Using measures of variability to describe the spread in data 


Discussing the principle types of measurement scales 


his section discusses how to graph numerical information—to trans- 

form it into a picture—to get a better sense of the data. Another way 
to get a sense of data is to use numerical measures, certain numbers that 
give special insight into your values. Two types of numerical measures are 
important in statistics: measures of central tendency and measures of 
variation. Each of these individual measures can provide information 
about the entire set of data. 


Measures of Central Tendency 


Measures of central tendency are numbers that tend to cluster around the 
“middle” of a set of values. Three such middle numbers are the mean, the 
median, and the mode. 


For example, suppose your earnings for the past week were the values 


shown in Table 3-1. 


Table 3-1 Earnings for the Past Week 


Day Amount 
Monday $350 
Tuesday $150 


(continued) 
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Table 3-1 (continued) 


Day Amount 
Wednesday $100 
Thursday $350 
Friday $50 
Mean 


You could express your daily earnings from Table 3-1 in a number of ways. 
One way is to use the average, or mean, of each day. The arithmetic mean 
is the sum of the measures in the set divided by the number of measures 
in the set. Totaling all the measures and dividing by the number of mea- 
sures, you get $1,000 + 5 = $200. 


Median 

Another measure of central tendency is the median, which is defined as the 
middle value when the numbers are arranged in increasing or decreasing 
order. When you order the daily earnings shown in Table 3-1, you get $50, 
$100, $150, $350, $350. The middle value is $150, and therefore, $150 is 


the median. 


If there is an even number of items in a set, the median is the average of 
the two middle values. For example, if we had four values—4, 10, 12, 26-— 
the median would be the average of the two middle values, 10 and 12; 
thus, 11 is the median in that case. The median may sometimes be a bet- 
ter indicator of central tendency than the mean, especially when there are 
outliers, or extreme values. 


Example 1: For example, given the four annual salaries of a corporation 
shown in Table 3-2, determine the mean and the median. 


Table 3-2 Four Annual Salaries 


Position Salary 
CEO $1,000,000 
Manager $50,000 
Administrative $30,000 


Clerical $20,000 
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The mean of these four salaries is $275,000. The median is the average of 
the middle two salaries, or $40,000. In this instance, the median appears 
to be a better indicator of central tendency because the CEO’s salary is an 
extreme outlier causing the mean to lie far from the other three salaries. 


Mode 

Another indicator of central tendency is the mode, or the value that occurs 
most often in a set of numbers. In the set of weekly earnings in Table 3-1, 
the mode would be $350 because it appears twice and the other values 
appear only once. 


Notation and formulae 

The mean of a sample is typically denoted as x (read as x bar). The mean 
of a population is typically denoted as Ul (read as mew.) The sum (or total) 
of measures is typically denoted with a X. The formula for a sample 
mean is 


ae oe NRK wt Xe 
a n 


Mean for grouped data 

Occasionally you may have data that do not consist of actual values but 
rather grouped measures. For example, you may know that, in a certain 
working population, 32 percent earn between $25,000 and $29,999, 40 
percent earn between $30,000 and $34,999, 27 percent earn between 
$35,000 and $39,999, and the remaining 1 percent earn between $80,000 
and $85,000. This type of information is similar to that presented in a fre- 
quency table. (Refer to Chapter 2 for information about frequency tables 
and graphic displays.) Although you do not have precise individual mea- 
sures, you can, nevertheless, compute measures for grouped data, data 
presented in a frequency table. 


The formula for a sample mean for grouped data is 
_- ak 
eS ee 


where x denotes the midpoint of the interval; fx denotes the sum of the 
measurements of the interval. 
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For example, if 8 is the midpoint of a class interval and there are 10 mea- 
surements in the interval, fr = 10(8) = 80, the sum of the 10 measurements 
in the interval. 


Xfx denotes the sum of all the measurements in all class intervals. Dividing 
that sum by the number of measurements yields the sample mean for 
grouped data. 


For example, consider the information shown in Table 3-3. 


Table 3-3 Distribution of the Prices of Items Sold at a Garage Sale 


Class Interval Frequency (f) Midpoint 0) fx 
$1-$5 8 3 24 
$6-$10 6 8 48 
$11-$15 4 13 52 
$16-S20 2 18 36 
$21-$25 4 23 92 
$26-$30 6 28 168 
$31-$35 2 33 66 
n= 32 Xf = 486 


Substituting into the formula: 


z= Dit _ 486 . 15.19 


Therefore, the average price of items sold was about $15.19. 


Median for grouped data 

The median for grouped data may not necessarily be computed precisely 
because the actual values of the measurements may not be known. In that 
case, you can find the particular interval that contains the median and then 
approximate the median. 
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Using Table 3-3, you can see that there is a total of 32 measures. The 
median is between the 16th and 17th measure, and therefore the median 
falls within the $11-$15 interval. The formula for the best approximation 
of the median for grouped data is 


median= L+ “-(.5n- ae) 
bn 


where ZL = lower class limit of the interval that contains the median 
n = total number of measurements 
w = class width 
Snea= frequency of the class containing the median 


Xf,= sum of the frequencies for all classes before the median class 


Consider the information in Table 3-4. 


Table 3-4 Distribution of Prices of Items Sold at a Garage Sale 
Class Boundaries Frequency (f) 

$.995-$5.995 
$5.995-$10.995 
$10.995-$15.995 
$15.995-$20.995 
$20.995-$25.995 
$25.995-$30.995 
$30.995-$35.995 


OTO)| Fly] | OD | oo 


n= 32 


As we already know, the median is located in class interval $11-$15. So 
L=11, n= 32, w= 4.99, fred = 4, Lf = 14. 
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Substituting into the formula: 


median= L+ F650 iG) 
4 


= 10.995 + == (.5 (32) - 14) 
= nse ge 14) 
= 10.995 + 4:29 1 (2) 


= 10.995 + Sie = 13.49 


Symmetric distribution 
In a distribution displaying perfect symmetry, the mean, the median, and 
the mode are all at the same point, as shown in Figure 3-1. 


Figure 3-1 For a symmetric distribution, mean, median, and mode are 
equal. 


Skewed curves 

As you have seen, an outlier can significantly alter the mean of a series of 
numbers, whereas the median will remain at the center of the series. In 
such a case, the resulting curve drawn from the values will appear to be 
skewed, tailing off rapidly to the left or right. In the case of negatively 
skewed or positively skewed curves, the median remains in the center of 
these three measures. 


Figure 3-2 shows a negatively skewed curve. 
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Figure 3-2 A negatively skewed distribution, mean < median < mode. 


Figure 3-3 shows a positively skewed curve. 


Figure 3-3 A positively skewed distribution, mode < median < mean. 


Measures of Variability 


Measures of central tendency locate only the center of a distribution of 
measures. Other measures are often needed to describe data. 


For example, consider the two sets of numbers presented in Table 3-5. 


Table 3-5 Earnings of Two Employees 


Earnings of Employee A/Day Earnings of Employee B/Day 


$200 $200 
$210 $20 
$190 $400 
$201 so 
$199 $390 
$195 $10 
$205 $200 


$200 $380 
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The mean, the median, and the mode of each employee's daily earnings 
all equal $200. Yet, there is significant difference between the two sets of 
numbers. For example, the daily earnings of Employee A are much more 
consistent than those of Employee B, which show great variation. This 
example illustrates the need for measures of variation or spread. 


Range 

The most elementary measure of variation is range. Range is defined as 
the difference between the largest and smallest values. Note that the range 
for Employee A is $210— $190 = $20; the range for Employee B is 
$400 — $0 = $400. 


Deviation and variance 

The deviation is defined as the distance of the measurements away from 
the mean. In Table 3-5, Employee A’s earnings have considerably less devi- 
ation than do Employee B’s. The variance is defined as the sum of the 
squared deviations of 7 measurements from their mean divided by (7-1). 


So, from the table of employee earnings (Table 3-5), the mean for 
Employee A is $200, and the deviations from the mean are as follows: 


0, +10, -10, +1, -1, -5, +5, 0 
The squared deviations from the mean are, therefore, the following: 
0, 100, 100, 1, 1, 25, 25, 0 


The sum of these squared deviations from the mean equals 252. Dividing 
by (2-1), or 8 — 1, yields 252/7, which equals 36. The variance = 36. 


For Employee B, the mean is also $200, and the deviations from the mean 
are as follows: 


0, -180, +200, —200, +190, —190, 0, +180 
The squared deviations are, therefore, the following: 


0; 32,400; 40,000; 40,000; 36,100; 36,100; 0; 32,400 


Chapter 3: Numerical Measures 31 


The sum of these squared deviations equals 217,000. Dividing by (7— 1) 
yields 217,000/7, which equals 31,000. 


Although they earned the same totals, there is significant difference in vari- 
ance between the daily earnings of the two employees. 


Standard deviation 

The standard deviation is defined as the positive square root of the vari- 
ance; thus, the standard deviation of Employee A’s daily earnings is the 
positive square root of 36, or 6. The standard deviation of Employee B’s 
daily earnings is the positive square root of 31,000, or about 176. 


Notation 
2 . 
§ denotes the variance of a sample. 


2 : . 
© denotes the variance of a population. 
s denotes the standard deviation of a sample. 


6 denotes the standard deviation of a population. 


Empirical rule: The normal curve 
The practical significance of the standard deviation is that with mound- 
shaped (bell-shaped) distributions, the following rules apply: 


@ The interval from one standard deviation below the mean to one stan- 
dard deviation above the mean contains approximately 68 percent of 
the measurements. 


@ The interval from two standard deviations below the mean to two 
standard deviations above the mean contains approximately 95 per- 
cent of the measurements. 


m@ The interval from three standard deviations below the mean to three 
standard deviations above the mean contains approximately all of the 
measurements. 


These mound-shaped curves are usually called normal distributions or 
normal curves (see Figures 3-4 through 3-6). 
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Figure 3-4 The interval + o from the mean contains 68% of the 
measurements. 


68% 


Figure 3-5 = The interval + 20 from the mean contains 95% of the 
measurements. 


95% 


Figure 3-6 = The interval + 30 from the mean contains 99.7% of the 
measurements. 


99.7% 


Shortcut formulae 
A shortcut method of calculating variance and standard deviation requires 
two quantities: sum of the values and sum of the squares of the values. 


=x = sum of the measures 


2 
Xx* = sum of the squares of the measures 
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For example, using these six measures: 3, 9, 1, 2, 5, and 4: 
Xx=34+94+1424+54+4=24 
Le = 34 P41 4 PH 
=9+81+1+4+25+16= 136 


The quantities are then substituted into the shortcut formula to find 


De-». 


Siew eyes (249) 


Saye CY 
=136- 
21365 A. = 40 


The variance and standard deviation are now found as before: 


> Dye)? 40 | 
oe ge 


s=/9 =,/8 =2.828 
Percentile 
The Nth percentile is defined as the value such that N percent of the val- 
ues lie below it. So, for example, a score of 5 percent from the top score 


would be the 95th percentile because it is above 95 percent of the other 
scores (see Figure 3-7). 


Figure 3-7 95% of the test scores are less than the value of the 95th 
percentile. 


95% of scores 
000111111222223334455566667789991011 11 1112120315 


95th Percentile 


Quartiles and interquartile range 

The lower quartile (Q,) is defined as the 25th percentile; thus, 75 per- 
cent of the measures are above the lower quartile. The middle quartile 
(Q)) is defined as the 50th percentile, which is, in fact, the median of all 
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the measures. The upper quartile (Q,) is defined as the 75th percentile; 


thus, only 25 percent of the measures are above the upper quartile. 


The interquartile range (QR) is the set of measures lying between the 
upper and lower quartiles, or the middle 50 percent of the measures (see 
Figure 3-8). 


Figure 3-8 Aan illustration of the quartiles and the interquartile range. 
Scores on Test 


17 23 28 38 40,41 41 45 47 60,61 61 63 80 80,81 82 85 88 95 


f 


Q,(40.5) Median Q,(60.5) Q,(80.5) 
kk Interquartile Range (IQR) —- | 


Measurement Scales 


Different measurement scales allow for different levels of exactness, depend- 
ing upon the characteristics of the variables being measured. The four types 
of scales available in statistical analysis are 


M@ Nominal: A scale that measures data by name only. For example, 
religious affiliation (measured as Jewish, Christian, Buddhist, and so 
forth), political affiliation (measured as Democratic, Republican, 
Libertarian, and so forth), or style of automobile (measured as sedan, 
sports car, station wagon, van, and so forth). 


M@ Ordinal: Measures by rank order only. Other than rough order, no 
precise measurement is possible. For example, medical condition 
(measured as satisfactory, fair, poor, guarded, serious, and critical); 
social-economic status (measured as lower class, lower-middle class, 
middle class, upper-middle class, upper class); or military officer rank 
(measured as lieutenant, captain, major, lieutenant colonel, colonel, 
general). Such rankings are not absolute but rather “relative” to each 
other: Major is higher than captain, but we cannot measure the exact 
difference in numerical terms. Is the difference between major and 
captain equal to the difference between colonel and general? You 
cannot say. 


M@ Interval: Measures by using equal intervals. Here you can compare 
differences between pairs of values. The Fahrenheit temperature scale, 
measured in degrees, is an interval scale, as is the centigrade scale. The 
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temperature difference between 50 and 60 degrees centigrade (10 
degrees) equals the temperature difference between 80 and 90 degrees 
centigrade (10 degrees). Note that the 0 in each of these scales is arbi- 
trarily placed, which makes the interval scale different from ratio. 


@ Ratio: Similar to an interval scale, a ratio scale includes a 0 mea- 
surement that signifies the point at which the characteristic being 
measured vanishes (absolute 0). For example, income (measured in 
dollars, with 0 equal to no income at all), years of formal education, 
items sold, and so forth, are all ratio scales. 


Chapter Checkout 


Q&A 


1. A doodad machine is capable of making up to 200 doodads per hour. 


a. For four consecutive hours, the doodad machine makes 120, 
118, 124, and 112 doodads. Find the average rate of doodad 


production. 


b. During the fifth hour, the machine breaks after making only 21 
doodads. The remainder of the hour is spent repairing it. 
Normally, the machine must average 115 doodads per hour, over 
the course of an 8 hour day, in order to meet quota. What must 
the average rate be for the next 3 hours to meet this quota? 


2. A meteorologist measures the average rate of rainfall in order to test 
the saying, “When it rains, it pours.” Her data, showing the rainfall 
rate over the course of a rainy morning, are as follows: 


Time of Day 


10:00 - 10:15 a.m. 
10:15 - 10:30 a.m. 
10:30 - 10:45 a.m. 
10:45 - 11:00 a.m. 
11:00 - 11:15 a.m. 
11:15 - 11:30 a.m. 
11:30 - 11:45 a.m. 
11:45 - 12:00 p.m. 


Rainfall Rate (inches/min) 
0.011 
0.007 
0.001 
0.010 
0.003 
0.045 
0.002 
0.006 
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a. Find the median and mean. 
b. Calculate the variance and standard deviation. 


3. Can a distribution have the mean equal to the median, but the mode 
not equal to either mean or median? If so, draw it. 


Answers: la. 118.5 b. 141.67 2a. 0.006, 0.0106 b. 0.0002, 0.014 3. Yes 
(a symmetric but bimodal distribution is a good example). 


Chapter 4 
PROBABILITY 


Chapter Check-In 


4 Applying classic probability theory to simple events 


Learning how to analyze combinations of simple events based on these 
rules 


4 Learning about probability distributions, especially the binomial 
distribution 


Prrbic theory plays a central role in statistics. After all, statistical 
analysis is applied to a collection of data in order to discover 
something about the underlying events. These events may be connected 
to one another—for example, mutually exclusive—but the individual 
choices involved are assumed to be random. Alternatively, we may sample 
a population at random and make inferences about the population as a 
whole from the sample by using statistical analysis. Therefore, a solid 
understanding of probability theory—the study of random events—is nec- 
essary to understand how the statistical analysis works and also to correctly 
interpret the results. 


You have an intuition about probability. As you will see, in some cases, 
probability theory seems obvious. But be careful, for on occasion a seem- 
ingly obvious answer will turn out to be wrong—because sometimes your 
intuition about probability will fail. Even in seemingly simple cases, it is 
best to follow the rules of probability as described in this chapter rather 
than rely on your hunches. 


Classic Theory 


The classic theory of probability underlies much of probability in 
statistics. Briefly, this theory states that the chance of a particular outcome 
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occurring is determined by the ratio of the number of favorable outcomes 
(or “successes”) to the total number of outcomes. Expressed as a formula, 


(A) = number of favorable outcomes 
total number of possible outcomes 


For example, the probability of randomly drawing an ace from a well- 
shuffled deck of cards is equal to the ratio 4/52. Four is the number of 
favorable outcomes (the number of aces in the deck), and 52 is the num- 
ber of total outcomes (the number of cards in the deck). The probability 
of randomly selecting an ace in one draw from a deck of cards is therefore 
4/52, or .077. In statistical analysis, probability is usually expressed as a 
decimal and ranges from a low of 0 (no chance) to a high of 1.0 (certainty). 


The classic theory assumes that all outcomes have equal likelihood of 
occurring. In the example just cited, each card must have an equal chance 
of being chosen—no card is larger than any other or in any way more likely 
to be chosen than any other card. 


The classic theory pertains only to outcomes that are mutually exclusive 
(or disjoint), which means that those outcomes may not occur at the same 
time. For example, one coin flip can result in a head or a tail, but one coin 
flip cannot result in a head and a tail. So the outcome of a head and the 
outcome of a tail are said to be “mutually exclusive” in one coin flip, as is 
the outcome of an ace and a king as the outcome of one card being drawn. 


Relative Frequency Theory 


The relative frequency theory of probability holds that if an experiment 
is repeated an extremely large number of times and a particular outcome 
occurs a percentage of the time, then that particular percentage is close to 
the probability of that outcome. 


For example, if a machine produces 10,000 widgets one at a time, and 
1,000 of those widgets are faulty, the probability of that machine produc- 
ing a faulty widget is approximately 1,000 out of 10,000, or .10. 


Probability of Simple Events 


Example 1: What is the probability of simultaneously flipping three 
coins—a penny, a nickel, and a dime—and having all three land heads? 
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Using the classic theory, determine the ratio of number of favorable out- 
comes to the number of total outcomes. Table 4-1 lists all possible outcomes. 


Table 4-1 Possible Outcomes of Penny, Nickel, and Dime Flipping 
Outcome # Penny Nickel Dime 


1 


OI NID alF | wf]n 

a/a/al;/aj/zi/z2}]/x—/=z 
a/aj/zj/zj/jal/al/xziz 
aj/xzj/al\/xzjal/zja/z 


There are 8 different outcomes, only one of which is favorable (outcome 
#1: all three coins landing heads); therefore, the probability of three coins 
landing heads is 1/8, or .125. 


What is the probability of exactly two of the three coins landing heads? 
Again, there are the 8 total outcomes, but in this case only 3 favorable out- 
comes (outcomes #2, #3, and #5); thus, the probability of exactly two of 
three coins landing heads is 3/8 or .375. 


Independent Events 


Each of the three coins being flipped in the preceding example is what is 
known as an independent event. Independent events are defined as out- 
comes that are not affected by other outcomes. In other words, the flip of 
the penny does not affect the flip of the nickel, and vice versa. 


Dependent Events 


Dependent events, on the other hand, are outcomes that are affected by 
other outcomes. Consider the following example. 


Example 2: What is the probability of randomly drawing an ace from a 
deck of cards and then drawing an ace again from the same deck of cards, 
without returning the first drawn card back to the deck? 
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For the first draw, the probability of a favorable outcome is 4/52, as 
explained earlier; however, once that first card has been drawn, the total 
number of outcomes is no longer 52, but now 51 because a card has been 
removed from the deck. And if that first card drawn resulted in a favor- 
able outcome (an ace), there would now be only 3 aces in the deck. Or the 
number of favorable outcomes would remain at 4 if that first card drawn 
were not an ace. So the second draw is a dependent event because its prob- 
ability changes depending upon what happens on the first draw. 


If, however, you replace that drawn card back into the deck and shuffle 
well again before the second draw, then the probability for a favorable out- 
come for each draw will now be equal (4/52), and these events will be 
independent. 


Probability of Joint Occurrences 


Another way to compute the probability of all three flipped coins landing 
heads is as a series of three different events: first flip the penny, then flip 
the nickel, then flip the dime. Will the probability of landing three heads 
still be .125? 


Multiplication rule 
To compute the probability of two or more independent events all occur- 
ring—joint occurrence—multiply their probabilities. 


For example, the probability of the penny landing heads is 1/2, or .5; the 
probability of the nickel next landing heads is 1/2, or .5; and the proba- 
bility of the dime landing heads is 1/2, or .5; thus, note that 


5X5xX%.5=.125 


which is what you determined with the classic theory by assessing the ratio 
of number of favorable outcomes to number of total outcomes. The nota- 
tion for joint occurrence is P(AB) = P(A) x P(B) and reads: The probabil- 
ity of A and B both happening is equal to the probability of A times the 
probability of B. 


Using the multiplication rule, you can also determine the probability of 
drawing two aces in a row from a deck of cards. The only way to draw two 
aces in a row from a deck of cards is for both draws to be favorable. For 
the first draw, the probability of a favorable outcome is 4/52. But because 
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the first draw is favorable, only 3 aces are left among 51 cards. So the prob- 
ability of a favorable outcome on the second draw is 3/51. For both events 
to happen, you simply multiply those two probabilities together: 


4 _ = 

37 XS ee = .0045 
Note that these probabilities are not independent. If, however, you had 
decided to return the initial card drawn back to the deck before the sec- 
ond draw, then the probability of drawing an ace on each draw is 4/52, as 
these events are now independent. Drawing an ace twice in a row, with 


the odds being 4/52 both times, gives the following: 


4.4 _ 16 _ 
52; 52" 2708 
In either case, you use the multiplication rule because you are computing 


probability for favorable outcomes in all events. 


Addition rule 
Given mutually exclusive events, finding the probability of at least one of 
them occurring is accomplished by adding their probabilities. 


For example, what is the probability of one coin flip resulting in at least 
one head or at least one tail? 


The probability of one coin flip landing heads is .5, and the probability of 
one coin flip landing tails is .5. Are these two outcomes mutually exclusive 
in one coin flip? Yes, they are. You cannot have a coin land both heads and 
tails in one coin flip; therefore, you can determine the probability of at least 
one head or one tail resulting from one flip by adding the two probabilities: 


5 +.5 = 1.0 (or certainty) 


Example 3: What is the probability of at least one spade or one club being 
randomly chosen in one draw from a deck of cards? 


The probability of drawing a spade in one draw is 13/52; the probability 
of drawing a club in one draw is 13/52. These two outcomes are mutually 
exclusive in one draw because you cannot draw both a spade and a club in 
one draw; therefore, you can use the addition rule to determine the prob- 
ability of drawing at least one spade or one club in one draw: 


+13 _ 26 _ 


B57 78 


Sie 
We 
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Non-Mutually Exclusive Outcomes 


For the addition rule to apply, the events must be mutually exclusive. Now 
consider the following example. 


Example 4: What is the probability of the outcome of at least one head in 
two coin flips? 


Should you add the two probabilities as in the preceding examples? In the 
first example, you added the probability of getting a head and the proba- 
bility of getting a tail because those two events were mutually exclusive in 
one flip. In the second example, the probability of getting a spade was 
added to the probability of getting a club because those two outcomes were 
mutually exclusive in one draw. Now when you have two flips, should you 
add the probability of getting a head on the first flip to the probability of 
getting a head on the second flip? Are these two events mutually exclusive? 


Of course, they are not mutually exclusive. You can get an outcome of a 
head on one flip and a head on the second flip. So, because they are not 
mutually exclusive, you cannot use the addition rule. If you did use the 
addition rule, you would get 


or certainty, which is absurd. There is no certainty of getting at least one 
head on two flips. (Try it several times, and see that there is a possibility 
of getting two tails and no heads.) 


Double Counting 


By using the addition rule in a situation that is not mutually exclusive, you 
are in fact double counting. One way of realizing this double counting is 
to use the classic theory of probability: List all the different outcomes when 
flipping a coin twice and assess the ratio of favorable outcomes to total 
outcomes (see Table 4-2). 


Table 4-2 All Possible Outcomes of Flipping the Same Coin Twice 
First Flip with Second Flip 

head + head 

head + tail 
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First Flip with Second Flip 


tail + head 


tail + tail 


There are 4 total outcomes. Three of the outcomes have at least one head; 
therefore, the probability of throwing at least one head in two flips is 3/4 
or .75, not 1.0. But if you had used the addition rule, you would have 
added the two heads from the first flip to the two heads from the second 
flip and gotten 4 heads in 4 flips, 4/4 = 1.0! But the two heads in that first 
pair constitute only one outcome, and so by counting both heads for that 
outcome, you are double counting because this is the joint-occurrence out- 
come that is not mutually exclusive. 


To use the addition rule in a non-mutually exclusive situation, you must 
subtract any events that double count. In this case: 


1 1 I, 2323) ox 

zt 5 ia 75 
The notation, therefore, for at least one favorable occurrence in two 
events is 


P(A + B) = P(A) + PB) — PAB) 


This rule is read: The probability of at least one of the events A or B equals 
the probability of A plus the probability of B minus the probability of their 
joint occurrence. (Note that if they are mutually exclusive, then P(AB)— 
the joint occurrence—equals 0, and you simply add the two probabilities.) 


Example 5: What is the probability of drawing either a spade or an ace 
from a deck of cards? 


The probability of drawing a spade is 13/52; the probability of drawing 
an ace is 4/52. But the probability of their joint occurrence (an ace of 


spades) is 1/52. Thus, 


P(A+ B= P(A) + P(B)- P(AB 


Se ee 
52 +52 52 
-16_ 4 ~ 3977 
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Conditional Probability 


Sometimes you have more information than simply total outcomes and 
favorable outcomes and, hence, are able to make more informed judgments 
regarding probabilities. For example, suppose you know the following 
information: In a particular village, there are 60 women and 40 men. 
Twenty of those women are 70 years of age or older; 5 of the men are 70 
years of age or older. See Table 4-3. 


Table 4-3 Distribution of People in a Particular Village 


70+ years 69 or less Totals 
Women 20 40 60 
Men 5 35 40 
Totals 25 75 100 


What is the probability that a person selected at random in that town will 
be a woman? Because women constitute 60 percent of the total popula- 
tion, the probability is .60. 


What is the probability that a person 70+ years of age selected at random 
will be a woman? This question is different because the probability of A 
(being a woman) given B (the person in question is 70+ years of age) is 
now conditional upon B (being 70+ years of age). Because women num- 
ber 20 out of the 25 people in the 70+ years-old group, the probability of 
this latter question is 20/25, or .80. 


Conditional probability is found using this formula: 


AB _ P(AB) 


which is read: The probability of A given B equals the proportion of the 
total of Aand B to the total of B. The vertical bar in the expression AIB is 
read given that or given. 


Probability Distributions 
A probability distribution is a pictorial display of the probability—P(x)— 


for any value of x. Consider the number of possible outcomes of two coins 
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being flipped (see Table 4-4). Table 4-5 shows the probability distribution 


of the results of flipping two coins. Figure 4-1 displays this information 
graphically. 


Table 4-4 Possible Outcomes of Two Flipped Coins 


H+H = 2 heads 
H+T = 1 head 
T+H = 1 head 
T+T = 0 heads 


Table 4-5 Probability Distribution: Number of Heads 


x 464) 

0 1/4 or .25 
1 1/2 or .50 
2 1/4 or .25 


Figure 4-1 Probability distribution of the results of flipping two coins. 
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Discrete vs. continuous variables 

The number of heads resulting from coin flips can be counted only in inte- 
gers (whole numbers). The number of aces drawn from a deck can be 
counted only in integers. These “countable” numbers are known as dis- 
crete variables: 0, 1, 2, 3, 4, and so forth. Nothing in between two vari- 
ables is possible. For example, 2.6 is not possible. 
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Qualities such as height, weight, temperature, distance, and so forth, how- 
ever, can be measured using fractions or decimals as well: 34.27, 102.26, 
and so forth. These are known as continuous variables. 


Total of probabilities of discrete variables 

As you have noted, the probability of a discrete variable lies somewhere 
between 0 and 1, inclusive. For example, the probability of tossing one 
head in two coin flips is, as earlier, 50. The probability of tossing two heads 
in two coin flips is .25, and the probability of tossing no heads in two coin 
flips is .25. The sum (total) of probabilities for all values of x always equals 
1. For example, note that in Tables 4-3 and 4-5, adding the probabilities 
for all the possible outcomes yields a sum of 1. 


The Binomial 


A discrete variable that can result in only one of two outcomes is called 
binomial. For example, a coin flip is a binomial variable; but drawing a 
card from a standard deck of 52 is not. Whether a drug is either success- 
ful or unsuccessful in producing results is a binomial variable, as is whether 
a machine produces perfect or imperfect widgets. 


Binomial experiments 
Binomial experiments require the following elements: 


mM The experiment consists of a number of identical events (7). 


@ Each event has only one of two mutually exclusive outcomes. (These 
outcomes are called successes and failures.) 


m@ The probability of a success outcome is equal to some percentage, 
which is identified as a proportion, 7 


m@ This proportion, 7, remains constant throughout all events and is 
defined as the ratio of number of successes to number of trials. 


mM The events are independent. 


M Given all of the above, the binomial formula can be applied 
(x = number of favorable outcomes; 7 = number of events): 


P(x = a m(1-m)”"~ 


Aa 
m= n(n— 1)(n— 2)... (3)(2)(1) 
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Example 6: A coin is flipped 10 times. What is the probability of getting 
exactly 5 heads? Using the binomial formula, where 7 (the number of 
events) is given as 10; x (the number of favorable outcomes) is given as 5; 
and the probability of landing a head in one flip is .5: 


P(x) = nm (1-1m)” * 


waa — x)! 


a ee )Q ~ 5) 


_10-9:8:7:6:5:4:3-2°1 
5-4-3-2-16-4-3:2-1) 

=252 (.0313)(.0313) 

=.246 


(.0313)(.0313) 


So the probability of getting exactly 5 heads in 10 flips is .246, or approx- 
imately 25 percent. 


Binomial table 

Because probabilities of binomial variables are so common in statistics, 
tables are used to alleviate having to continually use the formula. Refer to 
Table 1 in Appendix B, and you will find that given m = 10, x = 5, and 
T™ = .5, the probability is .2461. 


Mean and standard deviation 
The mean of the binomial probability distribution is determined by the 
following formula: 


U = mm 


where 7 is the proportion of favorable outcomes and 7 is the number of 
events. 


The standard deviation of the binomial probability distribution is 
determined by this formula: 


o= /nn(1-7) 


Example 7: What is the mean and standard deviation for a binomial prob- 
ability distribution for 10 coin flips of a fair coin? 


48 CliffsQuickReview Statistics 


Because the proportion of favorable outcomes of a fair coin falling heads 
(or tails) is 7 = .5, simply substitute into the formulas: 


W= nm=10(.5)=5 
o= /nn(1 —2) = /10(5)(.5) = /2.5 = 1.58 


The probability distribution for the number of favorable outcomes is 
shown in Figure 4-2. 


Figure 4-2 The binomial probability distribution of the number of heads 
resulting from 10 coin tosses. 
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Note that this distribution appears to display symmetry. Binomial proba- 
bility distributions and use of the normal approximation to replace the 
probability histogram are discussed in Chapter 5. 


Chapter Checkout 


Q&A 
1. A dart board is divided into twenty equal wedges, ignoring the 
bull’s eye. 
a. If only six of the twenty wedges are worth points, what is the prob- 
ability of scoring on one throw (assuming you can hit the board)? 
b. What is the chance of hitting at least one scoring region in three 
consecutive throws? 
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2. During a boring lecture in statistics class, you begin repeatedly flip- 
ping an old, worn quarter. Amazingly, the first ten flips are all heads. 


a. What is the chance of this, if the coin is fair? 

b. Suppose that the coin is not fair. If the probability of heads is 75 
percent rather than 50 percent, what is the chance of your result? 

c. How unfair would the coin have to be in order for your result to 
have had a 50 percent chance of occurring? 


3. A woman has three sons. What is the probability that her next child 
will be a girl? 


4. Consider an ordinary (six-sided) die, assumed to be fair. 


a. Use the binomial distribution to determine the probability that, 
in 10 rolls of the die, you will see exactly 4 sixes. 

b. Ifyou roll the die 100 times, what is the chance of rolling exactly 
40 sixes? 


Answers: la. 0.3 b. 0.657 2a. 9.77x10™ b. 0.0563 c. p(heads) = 0.933 
3. 0.5 4a. 0.0543 b. 1.82x10°* 


Chapter 5 
SAMPLING 


Chapter Check-In 


Examining the differences between populations and samples 


Learning about sampling distributions, sampling errors, and the 
Central Limit Theorem 


Discussing in detail the properties of the normal distribution 


A: discussed in Chapter 1, the field of inferential statistics enables you 
to make educated guesses about the numerical characteristics of large 
groups. The logic of sampling gives you a way to test conclusions about 
such groups using only a small portion of its members. 


Populations, Samples, Parameters, 
and Statistics 
A population is a group of phenomena that have something in common. 
The term often refers to a group of people, as in the following examples: 
All registered voters in Crawford County 
All members of the International Machinists Union 
All Americans who played golf at least once in the past year 


But populations can refer to things as well as people: 


All widgets produced last Tuesday by the Acme Widget Company 
All daily maximum temperatures in July for major U.S. cities 
All basal ganglia cells from a particular rhesus monkey 


Often, researchers want to know things about populations but don’t have 
data for every person or thing in the population. If a company’s customer 
service division wanted to learn whether its customers were satisfied, it would 
not be practical (or perhaps even possible) to contact every individual who 
purchased a product. Instead, the company might select a sample of the 
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population. A sample is a smaller group of members of a population selected 
to represent the population. In order to use statistics to learn things about 
the population, the sample must be random. A random sample is one in 
which every member of a population has an equal chance to be selected. 


A parameter is a characteristic of a population. A statistic is a character- 
istic of a sample. Inferential statistics enables you to make an educated 
guess about a population parameter based on a statistic computed from a 
sample randomly drawn from that population. See Figure 5-1. 


Figure 5-1 Illustration of the relationship between samples and populations. 
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For example, say you want to know the mean income of the subscribers to 
a particular magazine—a parameter of a population. You draw a random 
sample of 100 subscribers and determine that their mean income i is $27,500 
(a statistic). You conclude that the population mean income [J is likely to be 
close to $27,500 as well. This example is one of statistical inference. 


Different symbols are used to denote statistics and parameters, as Table 
5-1 shows. 


Table 5-1 Comparison of Sample Statistics and 
Population Parameters 


Sample Statistic Population Parameter 
Mean x wu 
Standard deviation S, sd o 


Variance s o 
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Sampling Distributions 


Continuing with the earlier example, suppose that ten different samples 
of 100 people were drawn from the population, instead of just one. The 
income means of these ten samples would not be expected to be exactly 
the same, because of sampling variability. Sampling variability is the ten- 
dency of the same statistic computed from a number of random samples 
drawn from the same population to differ. 


Suppose that the first sample of 100 magazine subscribers was “returned” 
to the population (made available to be selected again), another sample of 
100 subscribers selected at random, and the mean income of the new sam- 
ple computed. If this process were repeated ten times, it might yield the 
following sample means: 


27,500 27,192 28,736 26,454 28,527 
28,407 27,592 27,684 28,827 27,809 


These ten values are part of a sampling distribution. The sampling dis- 
tribution of a statistic—in this case, of a mean—is the distribution 
obtained by computing the statistic for a large number of samples drawn 
from the same population. 


You can compute the sample mean of this sampling distribution by sum- 
ming the ten sample means and dividing by ten, which gives a distribu- 
tion mean of 27,872. Suppose that the mean income of the entire 
population of subscribers to the magazine is $28,000. (You usually do not 
know what it is.) You can see in Figure 5-2 that the first sample mean 
($27,500) was not a bad estimate of the population mean and that the 
mean of the distribution of ten sample means ($27,872) was even better. 


Figure 5-2 Estimation of the population mean becomes progressively 
more accurate as more samples are taken. 


Distribution of population 


Distribution of 
first sample of 100 


Distribution of 
means from 10 samples 
(part of a sampling 

distribution) 


28,000 (Actual population mean) 
27,872 (Mean of distribution of 10 means) 


27,500 (Mean of first sample) 
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The more sample means included in the sampling distribution, the more 
accurate the mean of the sampling distribution becomes as an estimate of 
the population mean. 


Random and Systematic Error 


‘Two potential sources of error occur in statistical estimation—two reasons 
a statistic might misrepresent a parameter. Random error occurs as a result 
of sampling variability. The ten sample means in the preceding section dif- 
fered from the true population mean because of random error. Some were 
below the true value; some above it. Similarly, the mean of the distribu- 
tion of ten sample means was slightly lower than the true population mean. 
If ten more samples of 100 subscribers were drawn, the mean of that dis- 
tribution—that is, the mean of those means—might be higher than the 
population mean. 


Systematic error or bias refers to the tendency to consistently underesti- 
mate or overestimate a true value. Suppose that your list of magazine sub- 
scribers was obtained through a database of information about air travelers. 
The samples that you would draw from such a list would likely overesti- 
mate the population mean of all subscribers’ income because lower-income 
subscribers are less likely to travel by air and many of them would be 
unavailable to be selected for the samples. This example would be one of 
bias. See Figure 5-3. 


Figure 5-3 Random (sampling) error and systematic error (bias) distort 
the estimation of population parameters from sample statistics. 


Low variability, low bias Low variability, high bias 
e e 
e.°e e e 
e eee eee e 
eeoeoeeod ecooeoooee 
T T 
True value True value 
High variability, low bias High variability, high bias 
e e 
e ee eee 
eo; eo e e020 0 06 e eo e808 e00080 


T T 
True value True value 
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Central Limit Theorem 


If the population of all subscribers to the magazine were normal, you would 
expect its sampling distribution of means to be normal as well. But what 
if the population were non-normal? The Central Limit Theorem states 
that even if a population distribution is strongly non-normal, its sampling 
distribution of means will be approximately normal for large sample sizes 
(over 30). The Central Limit Theorem makes it possible to use probabil- 
ities associated with the normal curve to answer questions about the means 
of sufficiently large samples. 


According to the Central Limit Theorem, the mean of a sampling distri- 
bution of means is an unbiased estimator of the population mean. 


Ur=p 
Similarly, the standard deviation of a sampling distribution of means is 


es 


Note that the larger the sample, the less variable the sample mean. The 
mean of many observations is less variable than the mean of few. The stan- 
dard deviation of a sampling distribution of means is often called the stan- 
dard error of the mean. Every statistic has a standard error, which is a 
measure of the statistic’s random variability. 


Example 1: If the population mean of number of fish caught per trip to 
a particular fishing hole is 3.2 and the population standard deviation is 
1.8, what are the population mean and standard deviation of 40 trips? 


Properties of the Normal Curve 


Known characteristics of the normal curve make it possible to estimate the 
probability of occurrence of any value of a normally distributed variable. 
Suppose that the total area under the curve is defined to be 1. You can 
multiply that number by 100 and say there is a 100 percent chance that 
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any value you can name will be somewhere in the distribution. (Remember, 
the distribution extends to infinity in both directions.) Similarly, because 
half of the area of the curve is below the mean and half is above it, you can 
say that there’s a 50 percent chance that a randomly chosen value will be 
above the mean and the same chance that it will be below it. 


It makes sense that the area under the normal curve is equivalent to the 
probability of randomly drawing a value in that range. The area is great- 
est in the middle, where the “hump” is, and thins out toward the tails. 
That’s consistent with the fact that there are more values close to the mean 
in a normal distribution than far from it. 


When the area of the standard normal curve is divided into sections by 
standard deviations above and below the mean, the area in each section is 
a known quantity (see Figure 5-4). As explained earlier, the area in each 
section is the same as the probability of randomly drawing a value in that 
range. 


Figure 5-4 The normal curve and the area under the curve between o units. 


Total area = 1.0 


For example, .3413 of the curve falls between the mean and one standard 
deviation above the mean, which means that about 34 percent of all the 
values of a normally distributed variable are between the mean and one 
standard deviation above it. It also means that there is a .3413 chance that 
a value drawn at random from the distribution will lie between these two 
points. 


Sections of the curve above and below the mean may be added together 
to find the probability of obtaining a value within (plus or minus) a given 
number of standard deviations of the mean (see Figure 5-5). For example, 
the amount of curve area between one standard deviation above the mean 
and one standard deviation below is .3413 + .3413 = .6826, which means 
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that approximately 68.26 percent of the values lie in that range. Similarly, 
about 95 percent of the values lie within two standard deviations of the 
mean, and 99.7 percent of the values lie within three standard deviations. 


Figure 5-5 The normal curve and the area under the curve between o units. 
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95% of area 
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In order to use the area of the normal curve to determine the probability 
of occurrence of a given value, the value must first be standardized, or 
converted to a score. To convert a value to a z-score is to express it in 
terms of how many standard deviations it is above or below the mean. After 
the zscore is obtained, its corresponding probability may be looked up in 
a table. The formula to compute a z-score is 


_ xp 
= 
where x is the value to be converted, UU is the population mean, and o is 
the population standard deviation. 


Example 2: A normal distribution of retail store purchases has a mean of 
$14.31 and a standard deviation of 6.40. What percentage of purchases 
were under $10? First, compute the zscore: 


_ 10- 14.31 _ 


o- —Gag .67 
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The next step is to look up the zscore in the table of standard normal 
probabilities (see Table 2 in Appendix B). The standard normal table lists 
the probabilities (curve areas) associated with given z-scores. 


Table 2 from Appendix B gives the area of the curve below <z or, in other 
words, the probability of obtaining a value of z or lower. Not all standard 
normal tables use the same format, however. Some list only positive 
z-scores and give the area of the curve between the mean and z. Such a 
table is slightly more difficult to use, but the fact that the normal curve is 
symmetric makes it possible to use it to determine the probability associ- 
ated with any z-score, and vice versa. 


To use Table 2 (the table of standard normal probabilities) in Appendix B, 
first look up the z-score in the left column, which lists z to the first deci- 
mal place. Then look along the top row for the second decimal place. The 
intersection of the row and column is the probability. In the example, you 
first find —0.6 in the left column and then .07 in the top row. Their inter- 
section is .2514. The answer, then, is that about 25 percent of the pur- 
chases were under $10 (see Figure 5-6). 


Figure 5-6 Finding a probability using a z-score on the normal curve. 


2514 | 


-.67 0 
$10.00 $14.31 


What if you had wanted to know the percentage of purchases above a cer- 
tain amount? Because Table 2 gives the area of the curve below a given z, 
to obtain the area of the curve above z simply subtract the tabled proba- 
bility from 1. The area of the curve above a z of —.67 is 1 —.2514 = .7486. 
Approximately 75% of the purchases were above $10. 


Just as Table 2 can be used to obtain probabilities from zscores, it can be 
used to do the reverse. 
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Example 3: Using the previous example, what purchase amount marks 
the lower 10 percent of the distribution? 


Locate in Table 2 the probability of .1000, or as close as you can find, and 
read off the corresponding zscore. The figure that you seek lies between 
the tabled probabilities of .0985 and .1003, which correspond to zscores 
of —1.29 and —1.28. To be conservative, use the more extreme value of 
z=-—1.29. Now, use the z formula, this time solving for x: 


x- 14.31 __ 
ane ae 


x— 14.31 = (— 1.29)(6.4) 
x=— 8.256 + 14.31 = 6.054 


Approximately 10 percent of the purchases were below $6.05. 


Normal Approximation to the Binomial 


In Chapter 4, which discusses probability, you saw that some variables are 
continuous, that there is no limit to the number of times you could divide 
their intervals into still smaller ones, although you may round them off 
for convenience. Examples include age, height, and cholesterol level. Other 
variables are discrete, or made of whole units with no values between them. 
Some discrete variables are the number of children in a family, the sizes of 
televisions available for purchase, or the number of medals awarded at the 
Olympic Games. 


Chapter 4 also introduces one kind of discrete variable, the binomial vari- 
able. A binomial variable can take only two values, often termed successes 
and failures. Examples include coin flips that come up either heads or tails, 
manufactured parts that either continue working past a certain point or 
do not, and basketball tosses that either fall through the hoop or do not. 


You discovered that the outcomes of binomial trials have a frequency dis- 
tribution, just as continuous variables do. The more binomial outcomes 
there are (for example, the more coins you flip simultaneously), the more 
closely the sampling distribution resembles a normal curve (see Figure 5-7). 
You can take advantage of this fact and use the table of standard normal 
probabilities (Table 2 in Appendix B) to estimate the likelihood of obtain- 
ing a given proportion of successes. You can do this by converting the test 
proportion to a zscore and looking up its probability in the standard nor- 
mal table. 
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Figure 5-7 As the number of trials increases, the binomial distribution 
approaches the normal distribution. 


One hundred trials 
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Standard deviation 
The mean of the normal approximation to the binomial is 
= mm 
and the standard deviation is 


o=/nn(1—-n) 


where 7 is the number of trials and 7 is the probability of success. The 
approximation will be more accurate the larger the 7 and the closer the 
proportion of successes in the population to .5. 
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Example 4: Assuming an equal chance of a new baby being a boy or a girl 
(that is, m = 0.5), what is the likelihood that 60 or more out of the next 
100 births at a local hospital will be boys? 


joes 60 — ((100) (.5)) 
o — /(100)(.5)(1-.5) 


=10).. 
5 


2 


According to Table 2, a z-score of 2 corresponds to a probability of .9772. 
As you can see in Figure 5-8, there is a .9772 chance that there will be 60 
percent or fewer boys, which means that the probability that there will be 
more than 60 percent boys is 1 — .9772 = .0228, or just over 2 percent. If 
the assumption that the chance of a new baby being a girl is the same as 
it being a boy is correct, the probability of obtaining 60 or fewer girls in 
the next 100 births is also .9772. 


Figure 5-8 Finding a probability using a z-score on the normal curve. 


Probability = .0228 
Probability = .9772 


Chapter Checkout 


Q&A 


1. Consider a fair, six-sided die. Using the normal approximation to the 
binomial distribution, determine the probability that in 10 succes- 
sive rolls, there will be exactly 4 sixes. 
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2. The speed limit on the street in front of your house is 40 mph. In 
order to measure compliance with the limit, you decide to measure 
the mean speed of vehicles on the street. Each day after work, you 
measure the speed of 20 cars and take their average. Your data for five 
days is shown in the table below. 


Day Average Speed (mph) 
Monday 38.3 
Tuesday 40.1 
Wednesday 34.2 
Thursday 45.1 
Friday 50.7 


Using this information answer the following questions: 


a. Find the mean and standard deviation of these means. Is it likely 
that drivers comply, on average, with the law? 

b. Assuming the population mean of drivers does comply, what is 
the probability of your results? Take 6 = 7.5 mph. Assume a nor- 
mal distribution of speeds about LL. 


Answers: 1. 0.0518 2a. 41.68, 6.38, yes. b. 0.59 


Critical Thinking 


4. On TV, surveys are often taken during sensational news stories to 
determine the “people’s opinion.” Usually, viewers are told to dial a 
phone number in order to vote yes or no on the issue. These phone 
numbers sometimes require the caller to pay a fee. Comment on this 
practice, in terms of sampling practice. Does it produce a random 
sample? If not, how could it be modified to produce a more random 
sample? Who is most likely to call these numbers? 


Chapter 6 
PRINCIPLES OF TESTING 


Chapter Check-In 


Stating hypotheses and corresponding null hypotheses 


Learning how to apply simple statistical tests to determine if a null 
hypothesis is supported 


O Understanding the types of statistical errors and estimating them 


Calculating test statistics, confidence intervals, and significance 


Mic people assume that the whole point of statistical analysis is to 
prove, or disprove, a certain claim. This chapter introduces you to 
techniques for testing claims based on the available data. These techniques 
are simple yet surprisingly powerful. 


Stating Hypotheses 


One common use of statistics is the testing of scientific hypotheses. First, 
the investigator forms a research hypothesis that states an expectation to 
be tested. Then the investigator derives a statement that is the opposite of 
the research hypothesis. This statement is called the null hypothesis (in 
notation: H,). It is the null hypothesis that is actually tested, not the 
research hypothesis. If the null hypothesis can be rejected, that is taken as 
evidence in favor of the research hypothesis (also called the alternative 
hypothesis, 1, in notation). Because individual tests are rarely conclusive, 
it is usually not said that the research hypothesis has been “proved,” only 
that it has been supported. 


An example of a research hypothesis comparing two groups might be the 
following: 
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Fourth-graders in Elmwood School perform differently in math than 
fourth-graders in Lancaster School. 
Or in notation: A Wi AL, 
or sometimes: Ai Uy, — Us # 0 
The null hypothesis would be: 


Fourth-graders in Elmwood School perform the same in math as 
fourth-graders in Lancaster School. 


In notation: FA: Wy = Py 
or alternatively: Ah: Wi — Wy = 0 


Some research hypotheses are more specific than that, predicting not only 
a difference but a difference in a particular direction: 


Fourth-graders in Elmwood School are better in math than fourth- 
graders in Lancaster School. 


In notation: Ai Ui > Uo 
or alternatively: AH; UWi—U>0 


The accompanying null hypothesis must be equally specific so that all pos- 
sibilities are covered: 


Fourth-graders in Elmwood School are worse in math, or equal to, 
fourth-graders in Lancaster School. 


In notation: Ah: Wy S Ue 


or alternatively: Ah: U;—bW, $0 


The Test Statistic 


Hypothesis testing involves the use of distributions of known area, like the 
normal distribution, to estimate the probability of obtaining a certain value 
as a result of chance. The researcher is usually betting that the probability 
will be low because that means it’s likely that the test result was not a mere 
coincidence but occurred because the researcher’s theory is correct. It could 
mean, for example, that it’s probably not just bad luck but faulty packag- 
ing equipment that caused you to get a box of raisin cereal with only five 
raisins in it. 
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Only two outcomes of a hypothesis test are possible: either the null hypoth- 
esis is rejected, or it is not. You have seen that values from normally dis- 
tributed populations can be converted to zscores and their probabilities 
looked up in Table 2 in Appendix B (see Chapter 5 for a complete dis- 
cussion). The zscore is one kind of test statistic that is used to determine 
the probability of obtaining a given value. In order to test hypotheses, you 
must decide in advance what number to use as a cutoff for whether the 
null hypothesis will be rejected or not. This number is sometimes called 
the critical or tabled value because it is looked up in a table. It represents 
the level of probability that you will use to test the hypothesis. If the com- 
puted test statistic has a smaller probability than that of the critical value, 
the null hypothesis will be rejected. 


For example, suppose you want to test the theory that sunlight helps pre- 
vent depression. One hypothesis derived from this theory might be that 
hospital admission rates for depression in sunny regions of the country are 
lower than the national average. Suppose that you know the national 
annual admission rate for depression to be 17 per 10,000. You intend to 
take the mean of a sample of admission rates from hospitals in sunny parts 
of the country and compare it to the national average. 


Your research hypothesis is: 


The mean annual admission rate for depression from the hospitals 
in sunny areas is less than 17 per 10,000. 


In notation: Af; Wy < 17 per 10,000 
The null hypothesis is: 


The mean annual admission rate for depression from the hospitals 
in sunny areas is equal to or greater than 17 per 10,000. 


In notation: Fy: Wy, 2 17 per 10,000 


Your next step is to choose a probability level for the test. You know that 
the sample mean must be lower than 17 per 10,000 in order to reject the 
null hypothesis, but how much lower? You settle on a probability level of 
95 percent. That is, if the mean admission rate for the sample of sunny hos- 
pitals is so low that the chance of obtaining that rate from a sample selected 
at random from the national population is less than 5 percent, you'll reject 
the null hypothesis and conclude that there is evidence to support the 
hypothesis that exposure to the sun reduces the incidence of depression. 
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Next, you look up the critical zscore—the z-score that corresponds to 
your chosen level of probability—in the standard normal table. It is impor- 
tant to remember which end of the distribution you are concerned with. 
Table 2 in Appendix B lists the probability of obtaining a given zscore or 
lower. That is, it gives the area of the curve below the z-score. Because a 
computed test statistic in the lower end of the distribution will allow you 
to reject your null hypothesis, you look up the z-score for the probability 
(or area) of .05 and find that it is —1.65. If you were hypothesizing that 
the mean in sunny parts of the country is greater than the national aver- 
age, you would have been concerned with the upper end of the distribu- 
tion instead and would have looked up the zscore associated with the 


probability (area) of .95, which is z= 1.65. 


The critical z-score allows you to define the region of acceptance and the 
region of rejection of the curve (see Figure 6-1). If the computed test 
statistic is below the critical z-score, you can reject the null hypothesis and 
say that you have provided evidence in support of the alternative hypoth- 
esis. If it is above the critical value, you cannot reject the null hypothesis. 


Figure 6-1 The z-score defines the boundary of the zones of rejection 
and acceptance. 


Region of 
rejection 


Region of 
acceptance 


Zz -1.65 -1.20 ie) 


Rate (per 10,000): 13 17 


Suppose that the mean admission rate for the sample of hospitals in sunny 
regions is 13 per 10,000 and suppose also that the corresponding z-score 
for that mean is —1.20. The test statistic falls in the region of acceptance; 
so you cannot reject the null hypothesis that the mean in sunny parts of 
the country is significantly lower than the mean in the national average. 
There is a greater than a 5 percent chance of obtaining a mean admission 


66 CliffsQuickReview Statistics 


rate of 13 per 10,000 or lower from a sample of hospitals chosen at ran- 
dom from the national population, so you cannot conclude that your sam- 
ple mean could not have come from that population. 


One- and Two-tailed Tests 


In the previous example, you tested a research hypothesis that predicted 
not only that the sample mean would be different from the population 
mean but that it would be different in a specific direction—it would be 
lower. This test is called a directional or one-tailed test because the region 
of rejection is entirely within one tail of the distribution. 


Some hypotheses predict only that one value will be different from another, 
without additionally predicting which will be higher. The test of such a 
hypothesis is non-directional or two-tailed because an extreme test statis- 
tic in either tail of the distribution (positive or negative) will lead to the 
rejection of the null hypothesis of no difference. 


Suppose that you suspect that a particular class’s performance on a profi- 
ciency test is not representative of those people who have taken the test. 
The national mean score on the test is 74. 


The research hypothesis is: 
The mean score of the class on the test is not 74. 
Or in notation: Hy; W#74 
The null hypothesis is: 
The mean score of the class on the test is 74. 
In notation: Ay: w= 74 


As in the last example, you decide to use a 95 percent probability level for 
the test. Both tests have a region of rejection, then, of five percent, or .05. 
In this example, however, the rejection region must be split between both 
tails of the distribution—.025 in the upper tail and .025 in the lower tail— 
because your hypothesis specifies only a difference, not a direction, as 
shown in Figure 6-2 (a). You will reject the null hypotheses of no differ- 
ence if the class sample mean is either much higher or much lower than 
the population mean of 74. In the previous example, only a sample mean 
much lower than the population mean would have led to the rejection of 
the null hypothesis. 
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Figure 6-2 Comparison of (a) a two-tailed test and (b) a one-tailed test, 
at the same probability level (95%). 


Two tests at the same probability level (95%) 


(a) Two-tailed test 


T T 
z: 1.96 0 1.80 1.96 
Test score: 74 77 


(b) One-tailed test 


T 
z: 0 —-1.651.80 
Test score: 74 77 


The decision of whether to use a one- or a two-tailed test is important 
because a test statistic that falls in the region of rejection in a one-tailed 
test may not do so in a two-tailed test, even though both tests use the same 
probability level. Suppose the class sample mean in your example was 77, 
and its corresponding z-score was computed to be 1.80. Table 2 in 
Appendix B shows the critical z-scores for a probability of .025 in either 
tail to be -1.96 and 1.96. In order to reject the null hypothesis, the test 
statistic must be either smaller than —1.96 or greater than 1.96. It is not, 
so you cannot reject the null hypothesis. Refer to Figure 6-2 (a). 


Suppose, however, you had a reason to expect that the class would perform 
better on the proficiency test than the population, and you did a one-tailed 
test instead. For this test, the rejection region of .05 would be entirely within 
the upper tail. The critical z-value for a probability of .05 in the upper tail 
is 1.65. (Remember that Table 2 in Appendix B gives areas of the curve below 
% so you look up the zvalue for a probability of .95.) Your computed test 
statistic of z= 1.80 exceeds the critical value and falls in the region of rejec- 
tion, so you reject the null hypothesis and say that your suspicion that the 
class was better than the population was supported. See Figure 6-2 (b). 


In practice, you should use a one-tailed test only when you have good rea- 
son to expect that the difference will be in a particular direction. A two- 
tailed test is more conservative than a one-tailed test because a two-tailed 
test takes a more extreme test statistic to reject the null hypothesis. 
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Type I and II Errors 


You have been using probability to decide whether a statistical test pro- 
vides evidence for or against your predictions. If the likelihood of obtain- 
ing a given test statistic from the population is very small, you reject the 
null hypothesis and say that you have supported your hunch that the sam- 
ple you are testing is different from the population. 


But you could be wrong. Even if you choose a probability level of 95 per- 
cent, that means there is a 5 percent chance, or 1 in 20, that you rejected 
the null hypothesis when it was, in fact, correct. You can err in the oppo- 
site way, too; you might fail to reject the null hypothesis when it is, in fact, 
incorrect. These two errors are called Type I and Type II, respectively. Table 
6-1 presents the four possible outcomes of any hypothesis test based on 
(1) whether the null hypothesis was accepted or rejected and (2) whether 
the null hypothesis was true in reality. 


Table 6-1 Types of Statistical Errors 


H, is actually: 
True False 
Test reject H, Type | error correct 
Decision accept H, correct Type II error 


A Type I error is often represented by the Greek letter alpha (0) and a Type 
II error by the Greek letter beta (B). In choosing a level of probability for 
a test, you are actually deciding how much you want to risk committing a 
Type I error—rejecting the null hypothesis when it is, in fact, true. For this 
reason, the area in the region of rejection is sometimes called the alpha level 
because it represents the likelihood of committing a Type I error. 


In order to graphically depict a Type IL or B, error, it is necessary to imag- 
ine next to the distribution for the null hypothesis a second distribution 
for the true alternative (see Figure 6-3). If the alternative hypothesis is actu- 
ally true, but you fail to reject the null hypothesis for all values of the test 
statistic falling to the left of the critical value, then the area of the curve of 
the alternative (true) hypothesis lying to the left of the critical value rep- 
resents the percentage of times that you will have made a Type IJ error. 
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Figure 6-3 Graphical depiction of the relation between Type | and Type II 
errors, and the power of the test. 
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Type | error 
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Type I and Type II errors are inversely related: As one increases, the other 
decreases. The Type I, or & (alpha), error rate is usually set in advance by 
the researcher. The Type II error rate for a given test is harder to know 
because it requires estimating the distribution of the alternative hypothe- 
sis, which is usually unknown. 


A related concept is power—the probability that a test will reject the null 
hypothesis when it is, in fact, false. You can see from Figure 6-3 that power 
is simply 1 minus the Type II error rate (8). High power is desirable. Like 
B, power can be difficult to estimate accurately, but increasing the sample 
size always increases power. 


Significance 


How do you know how much confidence to put in the outcome of a 
hypothesis test? The statistician’s criterion is the statistical significance of 
the test, or the likelihood of obtaining a given result by chance. This con- 
cept has already been spoken of, using several terms: probability, area of 
the curve, Type I error rate, and so forth. Another common representation 
of significance is the letter p (for probability) and a number between 0 and 
1. There are several ways to refer to the significance level of a test, and it 
is important to be familiar with them. All of the following statements, for 
example, are equivalent: 


The finding is significant at the .05 level. 

The confidence level is 95 percent. 

The Type I error rate is .05. 

The alpha level is .05. 

o =.05. 

There is a 95 percent certainty that the result is not due to chance. 
There is a 1 in 20 chance of obtaining this result. 
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The area of the region of rejection is .05. 
The p-value is .05. 
p=.-05. 


The smaller the significance level p, the more stringent the test and the 
greater the likelihood that the conclusion is correct. The significance level 
is usually chosen in consideration of other factors that affect and are 
affected by it, like sample size, estimated size of the effect being tested, and 
consequences of making a mistake. Common significance levels are .05 (1 
chance in 20), .01 (1 chance in 100), and .001 (1 chance in 1,000). 


The result of a hypothesis test, as has been seen, is that the null hypothesis 
is either rejected or not. The significance level for the test is set in advance 
by the researcher in choosing a critical test value. When the computed test 
statistic is large (or small) enough to reject the null hypothesis, however, it 
is customary to report the observed (actual) p-value for the statistic. 


If, for example, you intend to perform a one-tailed (lower tail) test using 
the standard normal distribution at p = .05, the test statistic will have to 
be smaller than the critical z-value of —1.65 in order to reject the null 
hypothesis. But suppose the computed z-score is —2.50, which has an asso- 
ciated probability of .0062. The null hypothesis is rejected with room to 
spare. The observed significance level of the computed statistic is p = .0062; 
so you could report that the result was significant at p < .01. This result 
means that even if you had chosen the more stringent significance level of 
.01 in advance, you would still have rejected the null hypothesis, which is 
stronger support for your research hypothesis than rejecting the null 
hypothesis at p = .05. 


It is important to realize that statistical significance and substantive, or 
practical, significance are not the same thing. A small, but important, 
real-world difference may fail to reach significance in a statistical test. 
Conversely, a statistically significant finding may have no practical conse- 
quence. This finding is especially important to remember when working 
with large sample sizes because any difference can be statistically signifi- 
cant if the samples are extremely large. 


Point Estimates and Confidence Intervals 


You have seen that the sample mean x is an unbiased estimate of the pop- 
ulation mean LW. Another way to say this is that x is the best point estimate 
of the true value of . Some error is associated with this estimate, however— 
the true population mean may be larger or smaller than the sample mean. 
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Instead of a point estimate, you might want to identify a range of possible 
values p might take, controlling the probability that is not lower than the 
lowest value in this range and not higher than the highest value. Such a range 
is called a confidence interval. 


Example 1: Suppose that you want to find out the average weight of all play- 
ers on the football team at Landers College. You are able to select ten play- 
ers at random and weigh them. The mean weight of the sample of players 
is 198, so that number is your point estimate. The population standard devi- 
ation is 6 = 11.50. What is a 90 percent confidence interval for the popu- 
lation weight, if you presume the players’ weights are normally distributed? 


This question is the same as asking what weight values correspond to the 
upper and lower limits of an area of 90 percent in the center of the distri- 
bution. You can define that area by looking up in Table 2 (in Appendix B) 
the z-scores that correspond to probabilities of .05 in either end of the 
distribution. They are —1.65 and 1.65. You can determine the weights that 
correspond to these zscores using the following formula: 


=%t 2-2 

(a b))=xtz Te 

The weight values for the lower and upper ends of the confidence inter- 

val are 192 and 204 (see Figure 6-4). A confidence interval is usually 

expressed by two values enclosed by parentheses, as in (192, 204). Another 

way to express the confidence interval is as the point estimate plus or minus 

a margin of error; in this case, it is 198 + 6 pounds. You are 90 percent 

certain that the true population mean of football player weights is between 
192 and 204 pounds. 


Figure 6-4 The relationship between point estimate, confidence interval, 


and z-score. 
90% confidence 
interval for u 
.90 
.05 05 
T T T 
Zz -1.65 0 1.65 
Weight: 192 198 204 


Point estimate 
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What would happen to the confidence interval if you wanted to be 95 per- 
cent certain of it? You would have to draw the limits (ends) of the inter- 
vals closer to the tails, in order to encompass an area of .95 between them 
instead of .90. That would make the low value lower and the high value 
higher, which would make the interval wider. The width of the confidence 
interval is related to the significance level, standard error, and 7 such that 
the following are true: 


M@ The higher the percentage of accuracy (significance) desired, the wider 
the confidence interval. 


@ The larger the standard error, the wider the confidence interval. 


M@ The larger the x, the smaller the standard error, and so the narrower 
the confidence interval. 


All other things being equal, a smaller confidence interval is always more 
desirable than a larger one because a smaller interval means the popula- 
tion parameter can be estimated more accurately. 


Estimating a Difference Score 


Imagine that instead of estimating a single population mean [, you wanted 
to estimate the difference between two population means [; and U,, such 
as the difference between the mean weights of two football teams. The 
statistic %, — % has a sampling distribution just as the individual means do, 
and the rules of statistical inference can be used to calculate either a point 
estimate or a confidence interval for the difference between the two pop- 
ulation means. 


Suppose you wanted to know which was greater, the mean weight of Landers 
College’s football team or the mean weight of Ingram College’s team. You 
already have a point estimate of 198 pounds for Landers’ team. Suppose that 
you draw a random sample of players from Ingram’s team, and the sample 
mean is 195. The point estimate for the difference between the mean weights 
of Landers’ team ({1;) and Ingram’s team (Uz) is 198 — 195 = 3. 


But how accurate is that estimate? You can use the sampling distribution of 
the difference score to construct a confidence interval for [, — Ly, just as you 
did for Figure 6-4. Suppose that when you do so, you find that the confi- 
dence interval limits are (-3, 9), which means that you are 90 percent cer- 
tain that the mean for the Landers team is between three pounds lighter and 
nine pounds heavier than the mean for the Ingram team (see Figure 6-5). 
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Figure 6-5 The relationship between point estimate, confidence interval, 
and z-score, for a test of the difference of two means. 


90% confidence 


; ——> 
interval for 4 - [5 
90 
05 05 
T T T T 
z: -1.65 0 1.65 
Ly -Mo -3 0 Tl 3 9 


Point estimate 


Suppose that instead of a confidence interval, you want to test the two- 
tailed hypothesis that the two team weights have different means. Your 
null hypothesis would be: 


FAX: Wy = Uy 
or 
At: Wy — Wy = 0 
To reject the null hypothesis of equal means, the test statistic—in this 
example, z-score—for a difference in mean weights of 0 would have to fall 
in the rejection region at either end of the distribution. But you have 
already seen that it does not—only difference scores less than —3 or greater 


than 9 fall in the rejection region. For this reason, you would be unable to 
reject the null hypothesis that the two population means are equal. 


This characteristic is a simple but important one of confidence intervals 
for difference scores. If the interval contains 0, you would be unable to 
reject the null hypothesis that the means are equal at the same significance 
level. 


Univariate Tests: An Overview 


To summarize, hypothesis testing of problems with one variable requires 
carrying out the following steps: 

1. State the null hypothesis and the alternative hypothesis. 

2. Decide on a significance level for the test. 

3. Compute the value of a test statistic. 
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4. Compare the test statistic to a critical value from the appropri- 
ate probability distribution corresponding to your chosen level 
of significance and observe whether the test statistic falls 
within the region of acceptance or the region of rejection. 


Thus far, you have used the test statistic z and the table of standard nor- 
mal probabilities (Table 2 in Appendix B) to carry out your tests. There 
are other test statistics and other probability distributions. The general for- 
mula for computing a test statistic for making an inference about a single 
population is 


observed sample statistic — tested value 


test statistic = 
standard error 

where observed sample statistic is the statistic of interest from the sample 
(usually the mean), tested value is the hypothesized population parameter 
(again, usually the mean), and standard error is the standard deviation of 
the sampling distribution divided by the positive square root of 7. 


The general formula for computing a test statistic for making an inference 
about a difference between two populations is 


statistic, — statistic, — tested value 
standard error 


test statistic = 


where statistic; and statistic, are the statistics from the two samples (usu- 
ally the means) to be compared, tested value is the hypothesized difference 
between the two population parameters (0 if testing for equal values), and 
standard error is the standard error of the sampling distribution, whose for- 
mula varies according to the type of problem. 


The general formula for computing a confidence interval is 


(a, 6) = *% + test statistic: standard error 


where a and bare the lower and upper limits of the confidence interval, x 
is the point estimate (usually the sample mean), ‘est statistic is the critical 
value from the table of the appropriate probability distribution (upper or 
positive value if z) corresponding to half of the desired alpha level, and 
standard error is the standard error of the sampling distribution. 


Why must the alpha level be halved before looking up the critical value 
when computing a confidence interval? Because the rejection region is split 
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between both tails of the distribution, as in a two-tailed test. For a confi- 
dence interval at 0 = .05, you would look up the critical value corresponding 
to an upper-tailed probability of .025. 


Chapter Checkout 


Q&A 


1. An advertiser wants to know if the average age of people watching a 
particular TV show regularly is less than 24 years. 


a. State the alternative and null hypotheses. 

b. Is this a one- or two-tailed test? 

c. A random survey of 50 viewers determines that their mean age 
is 19 years, with a standard deviation of 1.7 years. Find the 90 
percent confidence interval of the age of the viewers. 

d. What is the significance level for rejecting the null hypothesis? 


2. True or False: Statistical tests should always be performed on a null 
hypothesis. 


3. True or False: A result with a high level of significance is always very 
important. 


Answers: la. H; UW < 24; Ay: Ub = 24 b. one c. (18.6, 19.4) d. .0016 
2. True 3. False 


Critical Thinking 


1. Discuss the relationship between statistical tests and the assumption 
of causality. For example, suppose a researcher shows that, to a level 
& = 0.001, the students at Middletown High are shorter than the stu- 
dents at Clearview High. Would you say that going to Middletown 
High causes you to be short? What, if anything, would you conclude 
if research showed, to & = 0.001, that people living near a power plant 
die at a younger age than people living elsewhere? 


Chapter 7 
UNIVARIATE INFERENTIAL TESTS 


Chapter Check-In 


Learning about several statistical tests, including the z-test and ¢test 


Calculating confidence levels and confidence intervals for each test 


Applying these tests to examples 


hapter 6 explained how to formulate hypotheses and test them with 

the z-test. In Chapter 6, you also saw how a general test statistic is 
constructed and can be used to test a hypothesis against any probability 
distribution. 


In this chapter, you take a closer look at some of the most common sta- 
tistical tests used in scientific and sociological research: the z- and ¢tests. 
These tests are used to test hypotheses concerning population means or 
proportions. Each test is explained in a separate section with a simple for- 
mat so that you can easily look up how to calculate the appropriate test 
statistic. The formula for finding the confidence interval of your data is 
given as well. Most useful of all, each section contains several simple exam- 
ples to illustrate how to apply the test to your data. 


One-sample z-test 
Requirements: normally distributed population, 6 known 
Test for population mean 


Hypothesis test 


Formula: Z= 
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where x is the sample mean, A is a specified value to be tested, 6 is the 
population standard deviation, and 7 is the size of the sample. Look up 
the significance level of the z-value in the standard normal table (Table 2 
in Appendix B). 


Example 1 (one-tailed test): A herd of 1,500 steers was fed a special high- 
protein grain for a month. A random sample of 29 were weighed and had 
gained an average of 6.7 pounds. If the standard deviation of weight gain 
for the entire herd is 7.1, what is the likelihood that the average weight 
gain per steer for the month was at least 5 pounds? 


null hypothesis: =Hy:<5 


alternative hypothesis: H,:11 25 


tabled value for z< 1.28 is .8997 
1 — .8997 = .1003 


So the probability that the herd gained at least 5 pounds per steer is p < 
.1003. Should the null hypothesis of a weight gain of less than 5 pounds 
for the population be rejected? That depends on how conservative you 
want to be. If you had decided beforehand on a significance level of p < 
.05, the null hypothesis could not be rejected. 


Example 2 (two-tailed test): In national use, a vocabulary test is known 
to have a mean score of 68 and a standard deviation of 13. A class of 19 
students takes the test and has a mean score of 65. 


Is the class typical of others who have taken the test? Assume a significance 
level of p< .05. 


There are two possible ways that the class may differ from the population. 
Its scores may be lower than, or higher than, the population of all students 
taking the test; therefore, this problem requires a two-tailed test. First, state 
the null and alternative hypotheses: 


null hypothesis: 1: U = 68 
alternative hypothesis: : U4 68 


Because you have specified a significance level, you can look up the criti- 
cal z-value in Table 2 of Appendix B before computing the statistic. This 
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is a two-tailed test; so the .05 must be split such that .025 is in the upper 
tail and another .025 in the lower. The z-value that corresponds to —.025 
is —1.96, which is the lower critical z-value. The upper value corresponds 
to 1 — .025, or .975, which gives a z-value of 1.96. The null hypothesis of 
no difference will be rejected if the computed z statistic falls outside of the 
range of —1.96 to 1.96. 


Next, compute the z statistic: 


Because —1.006 is between —1.96 and 1.96, the null hypothesis of popu- 
lation mean is 68 and cannot be rejected. That is, the class can be con- 


sidered as typical of others who have taken the test. 


Confidence interval for population mean using z 


Formula: (4 |) = Zan: [- 
n 

where a and bare the limits of the confidence interval, x is the sample 

mean, 7 is the upper (or positive) z-value from the standard normal table 

corresponding to half of the desired alpha level (because all confidence 

intervals are two-tailed), © is the population standard deviation, and 7 is 

the size of the sample. 


Example 3: A sample of 12 machine pins has a mean diameter of 1.15 
inches, and the population standard deviation is known to be .04. What 
is a 99 percent confidence interval of diameter width for the population? 


First, determine the z-value. A 99 percent confidence level is equivalent 
to p< .01. Half of .01 is .005. The z-value corresponding to an area of 
.005 is 2.58. The interval may now be calculated: 
(a, 6) = 1.15 42.58 04 
12 
=1.15 + .03 
=(1.12, 1.18) 


It is 99 percent certain that the population mean of pin diameters lies 
between 1.12 and 1.18 inches. Note that this is not the same as saying 
that 99 percent of the machine pins have diameters between 1.12 and 
1.18 inches, which would be an incorrect conclusion from this test. 
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Choosing a sample size 

Because surveys cost money to administer, researchers often want to cal- 
culate how many subjects will be needed to determine a population mean 
using a fixed confidence interval and significance level. The formula is 


2 
n= (2°) 


where 7 is the number of subjects needed, zen is the critical z-value corre- 
sponding to the desired significance level, 6 is the population standard 
deviation, and w is the desired confidence interval width. 


Example 4: How many subjects will be needed to find the average age of 
students at Fisher College plus or minus a year, with a 95 percent signifi- 
cance level and a population standard deviation of 3.5? 


n= (QUP9C5)) "= (1872) - 47.06 


Rounding up, a sample of 48 students would be sufficient to determine 
students’ mean age plus or minus one year. Note that the confidence inter- 
val width is always double the “plus or minus” figure. 


One-sample f-test 
Requirements: normally distributed population, 6 is unknown 
Test for population mean 


Hypothesis test 


Formula: t= 


where x is the sample mean, A is a specified value to be tested, s is the sam- 
ple standard deviation, and 7 is the size of the sample. When the standard 
deviation of the sample is substituted for the standard deviation of the pop- 
ulation, the statistic does not have a normal distribution; it has what is 
called the distribution (see Table 3 in Appendix B). Because there is a 
different t-distribution for each sample size, it is not practical to list a sep- 
arate area-of-the-curve table for each one. Instead, critical t-values for com- 
mon alpha levels (.05, .01, .001, and so forth) are usually given in a single 
table for a range of sample sizes. For very large samples, the ¢-distribution 
approximates the standard normal (z) distribution. 
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Values in the ¢-table are not actually listed by sample size but by degrees 
of freedom (df). The number of degrees of freedom for a problem involv- 
ing the distribution for sample size 7 is simply 2 — 1 for a one-sample 
mean problem. 


Example 5 (one-tailed test): A professor wants to know if her introduc- 
tory statistics class has a good grasp of basic math. Six students are chosen 
at random from the class and given a math proficiency test. The professor 
wants the class to be able to score at least 70 on the test. The six students 
get scores of 62, 92, 75, 68, 83, and 95. Can the professor be at least 90 
percent certain that the mean score for the class on the test would be at 
least 70? 


null hypothesis: 1): u < 70 
alternative hypothesis: H,,: = 70 


First, compute the sample mean and standard deviation (see Chapter 2). 


62 
92 
7 475 
68 
83 5 
95 
475 


Next, compute the #value: 


IAG = DO 97 ©. 
as ly Aaa ae 


J6 


To test the hypothesis, the computed #value of 1.71 will be compared to 
the critical value in the +table. But which do you expect to be larger and 
which smaller? One way to reason about this is to look at the formula and 
see what effect different means would have on the computation. If the 
sample mean had been 85 instead of 79.17, the resulting value would 
have been larger. Because the sample mean is in the numerator, the larger 
it is, the larger the resulting figure will be. At the same time, you know 
that a higher sample mean will make it more likely that the professor will 
conclude that the math proficiency of the class is satisfactory and that the 
null hypothesis of less-than-satisfactory class math knowledge can be 
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rejected. Therefore, it must be true that the larger the computed t-value, 
the greater the chance that the null hypothesis can be rejected. It follows, 
then, that if the computed ¢-value is larger than the critical value from 
the table, the null hypothesis can be rejected. 


A 90 percent confidence level is equivalent to an alpha level of .10. Because 
extreme values in one rather than two directions will lead to rejection of 
the null hypothesis, this is a one-tailed test, and you do not divide the alpha 
level by 2. The number of degrees of freedom for the problem is 6 — 1 = 
5. The value in the ¢table for 10,5 is 1.476. Because the computed ¢-value 
of 1.71 is larger than the critical value in the table, the null hypothesis can 
be rejected, and the professor can be 90 percent certain that the class mean 
on the math test would be at least 70. 


Note that the formula for the one-sample ¢-test for a population mean is 
the same as the z-test, except that the ¢test substitutes the sample stan- 
dard deviation s for the population standard deviation 6 and takes critical 
values from the ¢-distribution instead of the g-distribution. The ¢distri- 
bution is particularly useful for tests with small samples (7 < 30). 


Example 6 (two-tailed test): A Little League baseball coach wants to know 
if his team is representative of other teams in scoring runs. Nationally, the 
average number of runs scored by a Little League team in a game is 5.7. 
He chooses five games at random in which his team scored 5, 9, 4, 11, and 
8 runs. Is it likely that his team’s scores could have come from the national 
distribution? Assume an alpha level of .05. 


Because the team’s scoring rate could be either higher than or lower than 
the national average, the problem calls for a two-tailed test. First, state the 
null and alternative hypotheses: 


null hypothesis: 1: u = 5.7 
alternative hypothesis: H,; U #5.7 


Next compute the sample mean and standard deviation: 


5 

9 

4 r= 32-74 
11 s= 2.88 

8 


Qo 
N 
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Next, the value: 


Now, look up the critical value from the ¢-table (Table 3 in Appendix B). 
You need to know two things in order to do this: the degrees of freedom 
and the desired alpha level. The degrees of freedom is 5 — 1 = 4. The over- 
all alpha level is .05, but because this is a two-tailed test, the alpha level 
must be divided by two, which yields .025. The tabled value for t95,4 is 
2.776. The computed ¢ of 1.32 is smaller than the ¢ from Table 3, so you 
cannot reject the null hypothesis that the mean of this team is equal to the 
population mean. The coach can conclude that his team fits in with the 
national distribution on runs scored. 


Confidence interval for population mean using t 


Formula: (aD) =%+ ton, ¢° = 
Jn 


where aand are the limits of the confidence interval, % is the sample 
mean, fy ap is the value from the table corresponding to half of the desired 
alpha level at 7— 1 degrees of freedom, s is the sample standard deviation, 
and 7 is the size of the sample. 


Example 7: Using the previous example, what is a 95 percent confidence 
interval for runs scored per team per game? 


First, determine the value. A 95 percent confidence level is equivalent to 
an alpha level of .05. Half of .05 is .025. The #value corresponding to an 
area of .025 at either end of the ¢-distribution for 4 degrees of freedom 
(t95,4) is 2.776. The interval may now be calculated: 


(a,b) = 5.7 £2.78 ae 


=5.7 + 3.58 
= (2.12, 9.28) 


The interval is fairly wide, mostly because 7 is small. 
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Two-sample z-test for Comparing 
Two Means 


Requirements: two normally distributed but independent populations, 6 
is known 


Hypothesis test 


%-%,-A 

Formula: g= 
O; , 9; 
V7 +H 


where x, and X, are the means of the two samples, A is the hypothesized 
difference between the population means (0 if testing for equal means), 6, 
and ©, are the standard deviations of the two populations, and ,and 7) 
are the sizes of the two samples. 


Example 8 (two-tailed test): The amount of a certain trace element in 
blood is known to vary with a standard deviation of 14.1 ppm (parts per 
million) for male blood donors and 9.5 ppm for female donors. Random 
samples of 75 male and 50 female donors yield concentration means of 28 
and 33 ppm, respectively. What is the likelihood that the population means 
of concentrations of the element are the same for men and women? 


Null hypothesis: A: LW; = Ha 

or fh: LW, — = 0 

alternative hypothesis: H; U, 4 Uy 
or: Hz Wy,— p40 


— _28- 33-0 _ 5 


140,95? J265+181_ 
75 * 50 


= 237 


The computed z-value is negative because the (larger) mean for females 
was subtracted from the (smaller) mean for males. But because the hypoth- 
esized difference between the populations is 0, the order of the samples in 
this computation is arbitrary—x, could just as well have been the female 
sample mean and x, the male sample mean, in which case z would be 2.37 
instead of —2.37. An extreme z-score in either tail of the distribution (plus 
or minus) will lead to rejection of the null hypothesis of no difference. 
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The area of the standard normal curve corresponding to a z-score of —2.37 
is .0089. Because this test is two-tailed, that figure is doubled to yield a 
probability of .0178 that the population means are the same. If the test 
had been conducted at a pre-specified significance level of  < .05, the null 
hypothesis of equal means could be rejected. If the specified significance 
level had been the more conservative (more stringent) 0 < .01, however, 
the null hypothesis could not be rejected. 


In practice, the two-sample z-test is not often used because the two pop- 
ulation standard deviations 6, and 6; are usually unknown. Instead, sam- 
ple standard deviations and the ¢distribution are used. 


Two-sample t-test for Comparing 
Two Means 


Requirements: two normally distributed but independent populations, 6 
is unknown 


Hypothesis test 
%,— %—A 
Formula: i SS 
St je 82 
Mm" My 


where x, and X, are the means of the two samples, A is the hypothesized 
difference between the population means (0 if testing for equal means), 5, 
and s)are the standard deviations of the two samples, and 7, and mare the 
sizes of the two samples. The number of degrees of freedom for the prob- 
lem is the smaller of 2-1 and 7)— 1. 


Example 9 (one-tailed test): An experiment is conducted to determine 
whether intensive tutoring (covering a great deal of material in a fixed amount 
of time) is more effective than paced tutoring (covering less material in the 
same amount of time). Two randomly chosen groups are tutored separately 
and then administered proficiency tests. Use a significance level of a < .05. 


null hypothesis: =: Uy, < be 

or fh: MW, -b. <0 

alternative hypothesis: H,: UW, > Ln 
or: Az Uy — Ly > 0 
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Group Method =n x 5 
1 intensive 12 46.31 6.44 
Z paced 10 42.79 Dae 
46.31 — 42.79 — 0 3.52 
t= = = 1.166 
6.44" 752° /3.46 + 5.66 
12 10 


The degrees of freedom parameter is the smaller of (12 — 1) and (10 - 1), 
or 9. Because this is a one-tailed test, the alpha level (.05) is not divided 
by two. The next step is to look up £95,5in the ¢-table (Table 3 in Appendix 
B), which gives a critical value of 1.833. The computed t of 1.166 does 
not exceed the tabled value, so the null hypothesis cannot be rejected. This 
test has not provided statistically significant evidence that intensive tutor- 
ing is superior to paced tutoring. 


Confidence interval for comparing two means 


Formula: (4 6) =%,—%,£ tan, ay oH + > 

where aand dare the limits of the confidence interval, %, and %, are the 
means of the two samples, f/2,,ris the value from the ¢table correspond- 
ing to half of the desired alpha level, s; and 5, are the standard deviations 
of the two samples, and mand mare the sizes of the two samples. The 
degrees of freedom parameter for looking up the ¢-value is the smaller of 


m—1and m-—1. 


Example 10: Estimate a 90 percent confidence interval for the difference 
between the number of raisins per box in two brands of breakfast cereal. 


Brand n % 5 
A 6 102.1 12.3 
B 9 93.6 7.52 


The difference between x, and xX, is 102.1 — 93.6 = 8.5. The degrees of 
freedom is the smaller of (6 — 1) and (9 — 1), or 5. A 90 percent confidence 
interval is equivalent to an alpha level of .10, which is then halved to give 
.05. According to Table 3, the critical value for tos; is 2.015. The interval 
may now be computed. 
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(a, b) = 8.5 + 2.015 - ee ce 


=8.5 + 2.015: / 25.22 + 6.28 
=8.5+11.31 
=(-— 2.81, 19.81) 


You can be 90 percent certain that Brand A cereal has between 2.81 fewer 
and 19.81 more raisins per box than Brand B. The fact that the interval 
contains 0 means that if you had performed a test of the hypothesis that 
the two population means are different (using the same significance level), 
you would not have been able to reject the null hypothesis of no difference. 


Pooled variance method 

If the two population distributions can be assumed to have the same 
variance—and therefore the same standard deviation—-s, and s, can be 
pooled together, each weighted by the number of cases in each sample. 
Although using pooled variance in a ¢test is generally more likely to yield 
significant results than using separate variances, it is often hard to know 
whether the variances of the two populations are equal. For this reason, 
the pooled variance method should be used with caution. The formula for 
the pooled estimator of 0” is 


_ (m= 1) 57+ (m- 1) s; 
n+ n,—2 


where s, and s,are the standard deviations of the two samples and m, and 
nyare the sizes of the two samples. 


The formula for comparing the means of two populations using pooled 
variance is 


X,—x,—-A 
2f1 1 
5, (sr + x) 
where %, and x, are the means of the two samples, A is the hypothesized 
difference between the population means (0 if testing for equal means), 5; 


is the pooled variance, and m,and mare the sizes of the two samples. The 
number of degrees of freedom for the problem is 


df= nm, + m,—2 
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Example 11 (two-tailed test): Does right- or left-handedness affect how 
fast people type? Random samples of students from a typing class are given 
a typing speed test (words per minute), and the results are compared. 
Significance level for the test: .10. Because you are looking for a difference 
between the groups in either direction (right-handed faster than left, or 
vice versa), this is a two-tailed test. 


null hypothesis: 1%: UW, = Us 

or: fo: UW, — Ws = 0 

alternative hypothesis: H; U, # Uy 
or: As Uy—-lW,4#0 


Group -handed n x 5 
1 right 16 55.8 Dal 
2 left 9 59.3 4.3 


First, calculate the pooled variance: 
»_ (16-1)5.7°+(9 = 1) 4.37 
a 16) =2 


= 487.35 + 147.92 
= 23 


S 


=27.62 
Next, calculate the #-value: 


55.8 -59.3-0 _ -3.5 _ 


[27:02( 46 + 5) /4.80 


16 9 


t= — 1.598 


The degrees-of-freedom parameter is 16 + 9 — 2, or 23. This test is a two- 
tailed one, so you divide the alpha level (.10) by two. Next, you look up 
tos3in the table (Table 3 in Appendix B), which gives a critical value of 
1.714. This value is larger than the absolute value of the computed ¢ of 
—1.598, so the null hypothesis of equal population means cannot be 
rejected. There is no evidence that right- or left-handedness has any effect 


on typing speed. 


88 CliffsQuickReview Statistics 


Paired Difference t-test 
Requirements: a set of paired observations from a normal population 


This t-test compares one set of measurements with a second set from the 
same sample. It is often used to compare “before” and “after” scores in 
experiments to determine whether significant change has occurred. 


Hypothesis test 


Formula: t= 


where x is the mean of the change scores, A is the hypothesized difference 
(0 if testing for equal means), sis the sample standard deviation of the dif 
ferences, and zis the sample size. The number of degrees of freedom for the 
problem is 7-1. 


Example 12 (one-tailed test): A farmer decides to try out a new fertilizer 
on a test plot containing 10 stalks of corn. Before applying the fertilizer, he 
measures the height of each stalk. Two weeks later, he measures the stalks 
again, being careful to match each stalk’s new height to its previous one. 
The stalks would have grown an average of six inches during that time even 
without the fertilizer. Did the fertilizer help? Use a significance level of .05. 


null hypothesis: A): A<6 

alternative hypothesis: H,; A > 6 

Stalk I 2 3 A 5 Gs. fe =D 10 
Before height 35.5 31.7 31.2 36.3 22.8 28.0 24.6 26.1 34.5 27.7 
After height 45.3 36.0 38.6 44.7 31.4 33.5 28.8 35.8 42.9 35.0 


Subtract each stalk’s “before” height from its “after” height to get the change 
score for each stalk; then compute the mean and standard deviation of the 
change scores and insert these into the formula. 


45.3 — 35.5 =9.8 
36.0 —31.7= 4.3 = 2 = 2 oe 2 
38.6 —-31.2=74 2s (9.8 — 7.8)" + (4.3 = 7.36) +...4+ (7.3 — 7.36) _ 4.216 


44.7 — 36.3 = 8.4 v10—1 


31.4 - 22.8 = 8.6 s= /4.216 


33.5 — 28.0 = 5.5 5 oe 

28.8 — 24.6 = 4.2 

35.8 — 26.1 =9.7 ee —4b5 ee 136 = 2.098 
42,9 — 34.5 = 8.4 2 oe 

35.0 227-7293 /10 


73.6/10=7.36 
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The problem has 7— 1, or 10 — 1 = 9 degrees of freedom. The test is one- 
tailed because you are asking only whether the fertilizer increases growth, 
not reduces it. The critical value from the ¢-table for ¢5,5 is 1.833. Because 
the computed ¢-value of 2.098 is larger than 1.833, the null hypothesis 
can be rejected. The test has provided evidence that the fertilizer caused 
the corn to grow more than if it had not been fertilized. The amount of 
actual increase was not large (1.36 inches over normal growth), but it was 
statistically significant. 


Test for a Single Population Proportion 


Requirements: binomial population, sample mm) 2 10, and sample 
m1 — To) 2 10, where My is the hypothesized proportion of successes in the 
population. 


Hypothesis test 


= Tt — To 
Formula: z [r= 0) 
n 


where tt is the sample proportion, 7 is the hypothesized proportion, and 
n is the sample size. Because the distribution of sample proportions is 
approximately normal for large samples, the z statistic is used. The test is 
most accurate when 7 (the population proportion) is close to .5 and least 
accurate when T is close to 0 or 1. 


Example 13 (one-tailed test): The sponsors of a city marathon have been 
trying to encourage more women to participate in the event. A sample of 
70 runners is taken, of which 32 are women. The sponsors would like to 
be 90 percent certain that at least 40 percent of the participants are women. 
Were their recruitment efforts successful? 


null hypothesis: Ah: 1 <.4 
alternative hypothesis: Hh: 1 = .4 


The proportion of women runners in the sample is 32 out of 70, or 45.7 
percent. The z-value may now be calculated: 


_ _.457-.40  _ .057 = 973 


. eae - 40) /.003 
70 
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From the ztable, you find that the probability of a z-value less than .97 
is .834, not quite the .90 required to reject the null hypothesis, so it 
cannot be concluded at that level of significance that the population of 
runners is at least 40 percent women. 


Confidence interval for a single population proportion 


‘ [n(1-t) 
Formula: T+ Zen’ ( . 


where i is the sample proportion, %,) is the upper z-value corresponding 
to half of the desired alpha level, and 7 is the sample size. 


Example 14: A sample of 100 voters selected at random in a congressional 
district prefer Candidate Smith to Candidate Jones by a ratio of 3 to 2. 
What is a 95 percent confidence interval of the percentage of voters in the 
district who prefer Smith? 


A ratio of 3 to 2 is equivalent to a proportion of 3/5 = .60. A 95 percent 
confidence interval is equivalent to an alpha level of .05, half of which 
is .025. The critical z-value corresponding to an upper probability of 
1 — .025 is 1.96. The interval may now be computed: 


/.60(1 — .60 
(4 b) = 60+ 1.96 one) 
=.60 + 1.96 /.002 


=.60 + .096 
=(.504,.696) 


It is 95 percent certain that between 50.4 percent and 69.6 percent of the 
voters in the district prefer Candidate Smith. Note that the problem could 
have been figured for Candidate Jones by substituting the proportion .40 
for Smith’s proportion of .60. 


Choosing a sample size 

In the previous problem, you estimated that the percentage of voters in 
the district who prefer Candidate Smith is 60 percent plus or minus about 
10 percent. Another way to say this is that the estimate has a “margin of 
error” of +10 percent, or a confidence interval width of 20 percent. That’s 
a pretty wide range. You may wish to make the margin smaller. 
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Because the width of the confidence interval decreases at a known rate as 
the sample size increases, it is possible to determine the sample size needed 
to estimate a proportion with a fixed confidence interval. The formula is 


n= (22s2)". px (1-pr) 


where 7 is the number of subjects needed, a2 is the z-value correspond- 
ing to half of the desired significance level, w is the desired confidence 
interval width, and p is an estimate of the true population proportion. A 
p* of .50 will result in a higher » than any other proportion estimate but 
is often used when the true proportion is not known. 


Example 15: How large a sample is needed to estimate the preference of 
district voters for Candidate Smith with a margin of error of + 4 percent, 
at a 95 percent significance level? 


You will conservatively estimate the (unknown) true population propor- 
tion of preference for Smith at .50. If it’s really larger (or smaller) than that, 
you will overestimate the size of the sample needed, but px = .50 is play- 
ing it safe. 


ie 2 
n=(2 8) -50(1—.50) 


=49*- 25 
=600.25 


A sample of about 600 voters would be needed to estimate the percentage 
of voters in the district who prefer Smith and to be 95 percent certain that 
the estimate is within (plus or minus) 4 percent of the true population 
percentage. 


Test for Comparing Two Proportions 


Requirements: two binomial populations, 1,25 and 2 (1-1) 25 (for 
each sample), where % is the hypothesized proportion of successes in the 
population. 


Difference test 


Hypothesis test 


Formula: 
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where T= 


and where 7, and m, are the sample proportions, A is their hypothesized 
difference (0 if testing for equal proportions), and mare the sample sizes, 
and x, and x, are the number of “successes” in each sample. As in the test 
for a single proportion, the z distribution is used to test the hypothesis. 


Example 16 (one-tailed test): A swimming school wants to determine 
whether a recently hired instructor is working out. Sixteen out of 25 of 
Instructor A’s students passed the lifeguard certification test on the first 
try. In comparison, 57 out of 72 of more experienced Instructor B’s stu- 
dents passed the test on the first try. Is Instructor A’s success rate worse 
than Instructor B’s? Use o = .10. 


null hypothesis: : 1, 27, 
alternative hypothesis: H,; 1, < T 


First, you need to compute the values for some of the terms in the for- 
mula. The sample proportion m is 16/25 = .640. The sample proportion 
Tis 57/72 = .792. Next, compute 7: 


ie 16+57 _ 73 _ 


eS ae) aa 
Finally, the main formula: 
640 — 792-0 
“T7501 -.753)(35 +75) 
peta Se De = 2 SDS 25489 


/(.186)(.054) 100 


The standard normal (z) table shows that the lower critical z-value for & = 
.10 is approximately —1.28. The computed z must be lower than —1.28 in 
order to reject the null hypothesis of equal proportions. Because the com- 
puted zis —1.517, the null hypothesis can be rejected. It can be concluded 
(at this level of significance) that Instructor A’s success rate is worse than 
Instructor B’s. 
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Confidence interval for comparing two proportions 


Formula: (4 6) =%,— + Zan s(D) 


m(1—7 m(1—1% 
na), B08) 


where s(D) ay 


and where aand bare the limits of the confidence interval of 1%, — 1, 1% 
and 7, are the sample proportions, %,) is the upper z-value corresponding 
to half of the desired alpha level, and 1 and 7 are the sizes of the two 
samples. 


Example 17: A public health researcher wants to know how two high 
schools, one in the inner city and one in the suburbs, differ in the percentage 
of students who smoke. A random survey of students gives the following 
results: 


Population n Smokers 
1 (inner-city) 125 47 
2. (suburban) 153 52 


What is a 90 percent confidence interval for the difference between the 
smoking rates in the two schools? 


The proportion of smokers in the inner-city school is t, = 47/125 = .376. 
The proportion of smokers in the suburban school is 1) =52/153 =.340. 
Next solve for s(D): 


_ [376(1— 376) _ 340(1 ~ 340) 
s(D) 5. 158 


235, 224 _ = 
195 + 753 77 .003 = .058 
A.90 percent confidence interval is equivalent to = .10, which is halved 


to give .05. The upper tabled value for zo; is 1.65. The interval may now 
be computed: 


(4, 6) = (.376 — .340) + (1.65) (.058) 
=.036 + .096 
=(—.060,. 132) 
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The researcher can be 90 percent certain that the true population pro- 
portion of smokers in the inner-city high school is between 6 percent lower 
and 13.2 percent higher than the proportion of smokers in the suburban 
high school. Thus, since the confidence interval contains zero, there is no 
difference between the two types of schools at & = .10. 


Chapter Checkout 


Q&A 


1. Suppose you count the number of days in May on which it rains in 
your town and find it to be 15 (out of 31). Checking the weather 
database, you find that for your town, the average number of rainy 
days in May is 10. 

a. What type of test would you use to determine if this May was 
unusually wet to & = 0.05? 

b. Perform the test. What is the value of your test statistic (that is, 
zor pf? 


2. A biologist believes that polluted water may be causing frogs in a lake 
to be smaller than normal. In an unpolluted lake, she measures 15 
adult frogs and finds a mean length of 7.6 inches with a standard 
deviation of 1.5 inches. In the polluted lake, she measures 23 adult 
frogs and finds a mean of 6.5 inches and a standard deviation of 2.3 
inches. 

a. State the null and alternative hypotheses. 
b. Calculate the appropriate test statistic. 


3. A high school track coach makes the sprinters on his team begin 
weight lifting, in the hopes that their speed will increase. Before 
beginning, each runs a 40 yard dash. After a month of training, each 
runs the dash again. Their times are given below. 


Runner 1 2 3 4 5 6 7 8 9 10 
Before training 4.8 5.1 5.5 49 5.6 6.0 5.8 5.3 6.1 4.7 
After training 4.7 5.0 5.1 5.0 5.2 5.1 5.6 5.1 5.5 5.0 
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a. State the alternative and null hypotheses. 
b. To what level can the coach be sure that his runners have 
improved? 


Answers: la. test for single population proportion b. 1.92 
2a. H,: Upoi< Hunpols H,: Moi Hunpol b. 1.784 3a. H,: Lafeer < Ubefores H: Water 
2 Ubefore b. p< .025 


Chapter 8 
BIVARIATE RELATIONSHIPS 


Chapter Check-In 


Checking for correlation between two variables 


Learning about regression 


1 Using the chi-square (”) test to determine whether two variables are 
independent 


S: far, you have been working with problems involving a single vari- 
able. Many times, however, you want to know something about the 
relationship between two variables. 


Correlation 


Consider Table 8-1, which contains measurements on two variables for ten 
people: the number of months the person has owned an exercise machine 
and the number of hours the person spent exercising in the past week. 


Table 8-1 Exercise Data for Ten People 

Person 7 2 3 4 5 6 7 8 9 10 
Months Owned 5 10 4 8 2 7 9 6 1 12 
Hours Exercised 5 2 8 3 8 5 5 7 10 3 


If you display these data pairs as points in a scatter plot (see Figure 8-1), 
you can see a definite trend. The points appear to form a line that slants 
from the upper left to the lower right. As you move along that line from 
left to right, the values on the vertical axis (hours of exercise) get smaller, 
while the values on the horizontal axis (months owned) get larger. Another 
way to express this is to say that the two variables are inversely related: 
The more months the machine was owned, the less the person tended to 
exercise. 
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Figure 8-1 The data in Table 8-1 is an example of negative correlation. 
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Hours of Exercise 
per week 


Months Machine Owned 


These two variables are correlated. More than that, they are correlated in 
a particular direction—negatively. For an example of a positive correla- 
tion, suppose that instead of displaying “hours of exercise” on the vertical 
axis, you put the person’s score on a test that measures cardiovascular fit- 
ness (see Figure 8-2). The pattern of these data points suggests a line that 
slants from lower left to upper right, which is the opposite of the direction 
of slant in the first example. Figure 8-2 shows that the longer the person 
has owned the exercise machine, the better his or her cardiovascular fit- 
ness tends to be. This might be true in spite of the fact that time spent 
exercising decreases the longer the machine has been owned because pur- 
chasers of exercise machines might be starting from a point of low fitness, 
which may improve only gradually. 


Figure 8-2 An example of positive correlation. 


Cardiovascular 
Fitness Score 
e 


Months Machine Owned 


If two variables are positively correlated, as the value of one increases, so 
does the value of the other. If they are negatively (or inversely) correlated, 
as the value of one increases, the value of the other decreases. 


A third possibility remains: that as the value of one variable increases, the 
value of the other neither increases nor decreases. Figure 8-3 is a scatter 
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plot of months exercise machine owned (horizontal axis) by person's height 
(vertical axis). No line trend can be seen in the plot. New owners of exer- 
cise machines may be short or tall, and the same is true of people who have 
had their machines longer. These two variables appear to be uncorrelated. 


Figure 8-3 An example of uncorrelated data. 
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You can go even farther in expressing the relationship between variables. 
Compare the two scatter plots in Figure 8-4. Both plots show a positive 
correlation because, as the values on one axis increase, so do the values on 
the other. But the data points in Figure 8-4 (b) are more closely packed 
than the data points in Figure 8-4 (a), which are more spread out. Ifa line 
were drawn through the middle of the trend, the points in Figure 8-4 (b) 
would be closer to the line than the points in Figure 8-4 (a). In addition 
to direction (positive or negative), correlations can also have strength, 
which is a reflection of the closeness of the data points to a perfect line. 
Figure 8-4 (b) shows a stronger correlation than Figure 8-4 (a). 


Figure 8-4 (a) Weak and (b) strong correlations. 
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Pearson's product moment coefficient (r), commonly referred to as the 
correlation coefficient, is a quantitative measure of correlation between 
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two interval-level variables. The coefficient 7 can take values from —1.0 to 
1.0. The sign of r indicates whether the correlation is positive or negative. 
The magnitude (absolute value) of 7 indicates the strength of the correla- 
tion, or how close the array of data points is to a straight line. 


The computing formula for 7 is 


_ dex (Zx)(Xy) 
ae (nA 1)5 85 


where Xxy is the sum of the xy cross-products (each x multiplied by its 
paired y), mis the size of the sample (the number of data pairs), Xx and Ly 
are the sums of the x and y values, and s, and s, are the sample standard 
deviations of x and y. 


Example 1: Use Table 8-2 to compute + for the relationship between 
months of exercise-machine ownership and hours of exercise per week. 
The first step is to compute the components required in the main formula. 
Let x be months of ownership and y be hours of exercise, although you 
could also do the reverse. 


Table 8-2. Determining the Correlation Coefficient for Table 8-1 


x y xy x V 
5 5 25 25 25 
10 2 20 100 4 
4 8 32 16 64 
8 3 24 64 9 
2 8 16 4 64 
7 5 35 49 25 
9 5 45 81 25 
6 7 42 36 49 
1 10 10 1 100 
12 3 36 144 9 


xx = 64 xy=56 “xy = 285 xXx’ = 520 Ly = 374 


(Xx)? = 64° = 4096 
(Xx)? = 56°= 3136 
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Now, compute the sample standard deviations for x and y using the for- 
mula from Chapter 3: 


(4x) 
oe oar 
x n—-1 

096 
_ 520 “10, 
> 10-1 


Finally, 7 may be computed: 


. _ B94 (E*)(Zy) 

ed (#=1)5,3, 

285 = zh (64) (56) 
(10 — 1) (3.502) (2.591) 


_=73.4 _ 
81.663 


— 0.899 


A correlation of 7 = —.899 is almost as strong as the maximum negative 
correlation of —1.0, reflecting the fact that your data points fall relatively 
close to a straight line. 


Finding the significance of r 
You might want to know how significant an r of —.899 is. The formula to 
test the null hypothesis that R (the population correlation) = 0 is 


r/(n-2) 
Mer) 
where ris the sample correlation coefficient, and 7 is the size of the sam- 


ple (the number of data pairs). The probability of ¢ may be looked up in 
Table 3 (in Appendix B) using 2 — 2 degrees of freedom. 


t= 
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_ =.899 /(10-2) 2,543 _ 
[1 -(-1.899") 338 2:807 


The probability of obtaining a t of —5.807 with 8 df(drop the sign when 
looking up the value of ¢) is lower than the lowest listed probability of 
.0005. If the correlation between months of exercise-machine ownership 
and hours of exercise per week were actually 0, you would expect an 7 of 
—.899 or lower in fewer than one out of a thousand random samples. 


To evaluate a correlation coefficient, first determine its significance. If the 
probability that the coefficient resulted from chance is not acceptably low, 
the analysis should end there; neither the coefficient’s sign nor its magni- 
tude may reflect anything other than sampling error. If the coefficient is 
statistically significant, the sign should give an indication of the direction 
of the relationship, and the magnitude indicates its strength. Remember, 
however, that all statistics become significant with a high enough 7. 


Even if it’s statistically significant, whether a correlation of a given magni- 
tude is substantively or practically significant depends greatly on the phe- 
nomenon being studied. Generally, correlations tend to be higher in the 
physical sciences, where relationships between variables often obey uni- 
form laws, and lower in the social sciences, where relationships may be 
harder to predict. A correlation of .4 between a pair of sociological vari- 
ables may be more meaningful than a correlation of .7 between two vari- 
ables in physics. 


Bear in mind also that the correlation coefficient measures only straight- 
line relationships. Not all relationships between variables trace a straight 
line. Figure 8-5 shows a curvilinear relationship such that values of y 
increase along with values of x up to a point, then decrease with higher 
values of x. The correlation coefficient for this plot is 0, the same as for the 
plot in Figure 8-3. This plot, however, shows a relationship that Figure 8-3 
does not. 


Correlation does not imply causation. The fact that Variable A and Variable 
B are correlated does not necessarily mean that A caused B or that B caused 
A (though either may be true). If you were to examine a database of demo- 
graphic information, for example, you would find that the number of 
churches in a city is correlated with the number of violent crimes in the 
city. The reason is not that church attendance causes crime, but that these 
two variables both increase as a function of a third variable: population. 


102 CliffsQuickReview Statistics 


Figure 8-5 Data that can cause trouble with correlation analysis. 


y 


Also note that the scales used to measure two variables have no effect on 
their correlation. If you had converted hours of exercise per week to min- 
utes per week, and/or months machine owned to days owned, the corre- 
lation coefficient would have been the same. 


The coefficient of determination 

The square of the correlation coefficient ris called the coefficient of deter- 
mination and can be used as a measure of the proportion of variability 
that two variables share, or how much one can be “explained” by the other. 
The coefficient of determination for this example is —0.899° = .808. 
Approximately 80 percent of the variability of each variable is shared with 
the other variable. 


Simple Linear Regression 


Although correlation is not concerned with causation in relationships 
among variables, a related statistical procedure called regression often is. 
Regression is used to assess the contribution of one or more “causing” vari- 
ables (called independent variables) to one “caused” (or dependent) vari- 
able. It can also be used to predict the value of one variable from the values 
of others. When there is only one independent variable and when the rela- 
tionship can be expressed as a straight line, the procedure is called simple 
linear regression. 


Any straight line in two-dimensional space can be represented by this 
equation: 


y= ar bx 


where y is the variable on the vertical axis, x is the variable on the horizon- 
tal axis, a is the y-value where the line crosses the vertical axis (often called 
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the intercept), and 0 is the amount of change in y corresponding to a one- 
unit increase in x (often called the slope). Figure 8-6 gives an example. 


Figure 8-6 A straight line. 


Returning to the exercise example, you observed that the scatter plot of 
points in Figure 8-1 resembles a line. The regression procedure fits the best 
possible straight line to an array of data points. If no single line can be 
drawn such that all the points fall on it, what is the “best” line? The best 
line is the one that minimizes the distance of all the data points to the line, 
a fact that will become clearer after you compute the line for the example. 


Regression is an inferential procedure, meaning that it can be used to draw 
conclusions about populations based on samples randomly drawn from 
those populations. Suppose that your ten exercise-machine owners were 
randomly selected to represent the population of all exercise-machine 
owners. In order to use this sample to make educated guesses about the rela- 
tionship between the two variables (months of machine ownership and time 
spent exercising) in the population, you need to rewrite the equation above 
to reflect the fact that you will be estimating population parameters: 


y= Bo + Bix 


All you have done is replace the intercept (a) with Bp and the slope (6) with 
B;. The formula to compute the parameter estimate ; is 


where Sy= Lxy- 
and B.=9-B.x 
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The formula to compute the parameter estimate By is 
B.=7-B.x 

where 7 and % are the two sample means. 


You have already computed the quantities that you need to substitute into 
these formulas for the exercise example—except for the mean of x(x), which 
is 64/10 = 6.4, and the mean of (7), which is 56/10 = 5.6. First, compute 
the estimate of the slope: 


_ (64)(56) _ 
S.= 285 — 10 =— 73.4 
= enn 4096 
S,.= 520 = (eee 110.4 
6, = —73-4 =- 0.665 


~ 110.4 — 
Now the intercept may be computed: 
Bo=7- B, x 
=5.6 — (—0.665(6.4)) 
=5.6 -(-4.256) 
=9.856 


So the regression equation for the example is y = 9.856 — 0.665(x). When 
you plot this line over the data points, the result looks like that shown in 
Figure 8-7. 


Figure 8-7 _ |llustration of residuals. 
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The vertical distance from each data point to the regression line is the error, 
or residual, of the line’s accuracy in estimating that point. Some points 
have positive residuals (they lie above the line); some have negative ones 
(they lie below it). If all the points fell on the line, there would be no error 
and no residuals. The mean of the sample residuals is always 0 because the 
regression line is always drawn such that half of the error is above it and 
half below. The equations that you used to estimate the intercept and slope 
determine a line of “best fit” by minimizing the sum of the squared resid- 
uals. This method of regression is called least squares. 


Because regression estimates usually contain some error (that is, all points 
do not fall on the line), an error term (€, the Greek letter epsilon) is usu- 
ally added to the end of the equation: 


y=Bot+ Bix+e 


The estimate of the slope B, for the exercise example was -0.665. The slope 
is negative because the line slants down from left to right, as it must for 
two variables that are negatively correlated, reflecting that one variable 
decreases as the other increases. When the correlation is positive, B; is pos- 
itive, and the line slants up from left to right. 


Confidence interval for the slope 

Example 2: What if the slope is 0, as in Figure 8-8? That means that y has 
no linear dependence on x, or that knowing x does not contribute any- 
thing to your ability to predict y. 


Figure 8-8 Example of uncorrelated data, so the slope is zero. 
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It is often useful to compute a confidence interval for a regression slope. 
If it contains 0, you would be unable to conclude that x and yare related. 
The formula to compute a confidence interval for B, is 


(4 6)=Bi+ lon, n-2° Te 


syn% 
where s= eE) 
and Sou pip ee) 


and where X(y—)” is the sum of the squared residuals, %/2,,_ > is the crit- 
ical value from the ¢-table corresponding to half the desired alpha level at 
n— 2 degrees of freedom, and 7 is the size of the sample (the number of 
data pairs). The test for this example will use an alpha of .05. Table 3 (in 
Appendix B) shows that fo5,3 = 2.306. 


Compute the quantity X(y—j)” by subtracting each predicted y-value (j) 
from each actual y-value, squaring it, and summing the squares (see Table 
8-3). The predicted y-value (7) is the y-value that would be predicted from 
each given x, using the formula y = 9.856 — 0.665 (x). 


Table 8-3. Determining the Residuals for the Data in Table 8-1 


x y y residual residual? 
5 5 - 6.530 = -1.530 2.341 
10 2 - 3.205 = -1.205 1.452 
4 8 - 7.195 = 0.805 .648 
8 3 - 4.535 = -1.535 2.356 
2 8 - 8.525 = -0.525 .276 
7 5 - 5.200 = -0.200 .040 
9 5 - 3.870 = 1.130 1.277 
6 7 - 5.865 = 1.135 1.288 
1 10 - 9.190 = 0.810 .656 
12 3 - 1.875 = 1.125 1.266 


0 11.600 
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Now, compute s: 


s= [A900 — /1.45 = 1.204 


You have already determined that S,.= 110.4; you can proceed to the main 
formula: 


1.204 
110.4 


1.204 
10.507 


=— 0.665 + 2.306(0.115) 
=— 0.665 + 0.264 
=(—0.929, — 0.401) 


(a, 6) =— 0.665 + 2.306 


=— 0.665 + 2.306 


You can be 95 percent certain that the population parameter B, (the slope) 
is no lower than —0.929 and no higher than -0.401. Because this interval 
does not contain 0, you would be able to reject the null hypothesis that 
B, = 0 and conclude that these two variables are indeed related in the 
population. 


Confidence interval for prediction 

You have learned that you could predict a y-value from a given x-value. 
Because there is some error associated with your prediction, however, you 
might want to produce a confidence interval rather than a simple point 
estimate. The formula for a prediction interval for y for a given x is 


(ab) =J4 tan, »-2(s) [1+4+ a 
where S,,= x?- een 


and where y is the y-value predicted for x using the regression equation, 
fu? -2 is the critical value from the ¢table corresponding to half the desired 
alpha level at n— 2 degrees of freedom, and 7 is the size of the sample (the 
number of data pairs). 


Example 3: What is a 90 percent confidence interval for the number of 
hours spent exercising per week if the exercise machine is owned 11 
months? 
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The first step is to use the original regression equation to compute a point 
estimate for y: 


y= 9.856 — 0.665 (11) 
=9.856 — 7.315 
=2.541 
For a 90 percent confidence interval, you need to use toss which Table 3 


(in Appendix B) shows to be 1.860. You have already computed the 
remaining quantities, so you can proceed with the formula: 


- fio, (I= 6A)? 
(4 6)=2.541+ 1.860(1.204) 1 + jo + 710.40 — 
21.16 


=2.541 + 1.860(1.204) /1+.10+ 110.40 


=2.541 + 1.860 (1.204) /1.292 
=2.541 + 2.545 
=(—0.004, 5.086) 
You can be 90 percent confident that the population mean for the num- 


ber of hours spent exercising per week when x (number of weeks machine 
owned) = 11 is between about 0 and 5. 


Assumptions and cautions 

The use of regression for parametric inference assumes that the errors (€) 
are (1) independent of each other and (2) normally distributed with the 
same variance for each level of the independent variable. Figure 8-9 shows 
a violation of the second assumption. The errors (residuals) are greater for 
higher values of x than for lower values. 


Figure 8-9 Data with increasing variance as x increases. 


y ee 
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Least squares regression is sensitive to outliers, or data points that fall far 
from most other points. If you were to add the single data point x = 15, 
y= 12 to the exercise data, the regression line would change to the dotted 
line shown in Figure 8-10. You need to be wary of outliers because they 
can influence the regression equation greatly. 


Figure 8-10 Least squares regression is sensitive to outliers. 
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It can be dangerous to extrapolate in regression—to predict values beyond 
the range of your data set. The regression model assumes that the straight 
line extends to infinity in both directions, which is often not true. According 
to the regression equation for the example, people who have owned their 
exercise machines longer than around 15 months do not exercise at all. It 
is more likely, however, that “hours of exercise” reaches some minimum 
threshold and then declines only gradually, if at all (see Figure 8-11). 


Figure 8-11 Extrapolation beyond the data is dangerous. 
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Chi-Square (y’) 


The statistical procedures that you have reviewed thus far are appropriate 
only for variables at the interval and ratio levels of measurement. The chi- 
square (Y”) test can be used to evaluate a relationship between two nom- 
inal or ordinal variables. It is one example of a non-parametric test. 
Non-parametric tests are used when assumptions about normal distribu- 
tion in the population cannot be met or when the level of measurement 
is ordinal or less. These tests are less powerful than parametric tests. 


Suppose that 125 children are shown three television commercials for 
breakfast cereal and are asked to pick which they liked best. The results are 
shown in Table 8-4. 


You would like to know if the choice of favorite commercial was related 
to whether the child was a boy or a girl or if these two variables are inde- 
pendent. The totals in the margins will allow you to determine the over- 
all probability of (1) liking commercial A, B, or C, regardless of gender, 
and (2) being either a boy or a girl, regardless of favorite commercial. If 
the two variables are independent, then you should be able to use these 
probabilities to predict approximately how many children should be in 
each cell. If the actual count is very different from the count that you would 
expect if the probabilities are independent, the two variables must be 
related. 


Table 8-4 Commercial Preference for Boys and Girls 


A B Cc Totals 
Boys 30 29 16 75 
Girls 12 33 5 50 
Totals 42 62 21 125 


Consider the first (upper left) cell of the table. The overall probability of a 
child in the sample being a boy is 75 + 125 = .6. The overall probability of 
liking Commercial A is 42+ 125 = .336. The multiplication rule (covered 
in Chapter 4) states that the probability of both of two independent events 
occurring is the product of their two probabilities. Therefore, the probabil- 
ity of a child both being a boy and liking Commercial A is .6 X .336 = .202. 
The expected number of children in this cell, then, is .202 x 125 = 25.2. 
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There is a faster way of computing the expected count for each cell: 
Multiply the row total by the column total and divide by 7. The expected 
count for the first cell is, therefore, (75 x 42) + 125 = 25.2. If you perform 
this operation for each cell, you get the expected counts (in parentheses) 
shown in Table 8-5. 


Table 8-5 Chi-Square Results for Table 8-4 


A B Cc Totals 
Boys 30 (25.2) 29 (37.2) 16 (12.6) 75 
Girls 12 (16.8) 33 (24.8) 5 (8.4) 50 
Totals 42 62 21 125 


Note that the expected counts properly add up to the row and column 
totals. You are now ready for the formula for x”, which compares each 
cell’s actual count to its expected count: 


( observed— expected ) : 


2 


X= 


expected 


The formula describes an operation that is performed on each cell and 
which yields a number. When all the numbers are summed, the result is 
x”. Now, compute it for the six cells in the example: 


20425,2)" g (20 37.2)? (16-12.6) 
25.2 Bias Ye 


(12-16.8)* (33-24.8)* (5-84)? 
i6gc 048 1 Be 
= 914 + 1.808 +917 + 1.371 #2.711 +1.376 


=9.097 


The larger x”, the more likely that the variables are related; note that the 
cells that contribute the most to the resulting statistic are those in which 
the expected count is very different from the actual count. 


Chi-square has a probability distribution, the critical values for which are 
listed in Table 4 in Appendix B. As with the distribution, x” has a degrees- 
of-freedom parameter, the formula for which is 


(#rows — 1) X (¢columns — 1) 
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or in your example: 
(2-l)x@-1)=1x2=2 


In Table 4 in Appendix B, a chi-square of 9.097 with two degrees of free- 
dom falls between the commonly used significance levels of .05 and .01. 
If you had specified an alpha of .05 for the test, you could, therefore, reject 
the null hypothesis that gender and favorite commercial are independent. 
At a= .01, however, you could not reject the null hypothesis. 


The x’ test does not allow you to conclude anything more specific than 
that there is some relationship in your sample between gender and com- 
mercial liked (at & = .05). Examining the observed vs. expected counts in 
each cell might give you a clue as to the nature of the relationship and 
which levels of the variables are involved. For example, Commercial B 
appears to have been liked more by girls than boys. But y’ tests only the 
very general null hypothesis that the two variables are independent. 


Chapter Checkout 


Q&A 
For the problems 1-3, consider this data on the dose of Medicine X admin- 
istered and the length of time of continued illness. 
Dose 0 1 2 3 4 5 6 
Time to cure (days) 15.4 12.2 13.7. 92 9.9 61 4.1 
1. a. Make a scatter plot of the data. Is this negative or positive 
correlation? 
b. Calculate the correlation coefficient. 
c. Find the significance of the correlation coefficient. 
2.a. Apply linear regression to the data to find the slope and 
intercept. 


b. Calculate the 95 percent confidence interval for the slope of your 
regression. 
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3. a. Predict the result if the dose was 1.5, and give the 90 percent 
confidence interval for your prediction. 
b. Predict the result if the dose was 8, and give the 90 percent con- 
fidence interval. 


Answers la. negative b. —.9511 c. t= —6.885, p< .0005 2a. B, = —1.78; 
Bo= 15.42 b. -1.78 + 0.66 3a. 12.76 + 3.05 b. 1.19 + 3.94 


Critical Thinking 


1. Imagine that the experiment in problems 1-3 is carried further, and 
the results are as follows: 
Dose fi 8 9 10 11 


Time to cure (days) 4.0 3.9 4.2 3.8 4.5 


Add this data to your scatter plot and comment on what is happen- 
ing. What about your prediction in Problem 3b.? 


Appendix A 
COMMON MISTAKES 


I: may seem that there are many ways to make errors in working a statis- 
tics problem. In fact, most errors on statistics exams can be reduced to 
a short list of common oversights. If you learn to avoid the mistakes listed 
here, you can greatly reduce your chances of making an error on an exam. 


M@ Forgetting to convert between standard deviation (6 and s) and 
variance (6 and s*). Some formulas use one; some use the other. 
Square the standard deviation to get the variance, or take the positive 
square root of the variance to get the standard deviation. 


M Misstating one-tailed and two-tailed hypotheses. If the hypothesis 
predicts simply that one value will be higher than another, it requires 
a one-tailed test. If however, it predicts that two values will be differ- 
ent—that is, one value will be either higher or lower than another or 
that they will be equal—then use a two-tailed test. Make sure your 
null and alternative hypotheses together cover all possibilities—greater 
than, less than, and equal to. 


m@ Failing to split the alpha level for two-tailed tests. If the overall 
significance level for the test is .05, then you must look up the criti- 
cal (tabled) value for a probability of .025. The alpha level is always 


split when computing confidence intervals. 


™ Misreading the standard normal (2) table. As explained in Chapter 
5, all standard normal tables do not have the same format, and it is 
important to know what area of the curve (or probability) the table 
presents as corresponding to a given z-score. Table 2 in Appendix B 
gives the area of the curve lying at or below z. The area to the right 
of z (or the probability of obtaining a value above z) is simply 1 minus 


the tabled probability. 
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Using 7 instead of — 1 degrees of freedom in one-sample ¢tests. 
Remember that you must subtract 1 from 7 in order to get the 
degrees-of-freedom parameter that you need in order to look up a 
value in the table. 


Confusing confidence level with confidence interval. The confi- 
dence level is the significance level of the test or the likelihood of 
obtaining a given result by chance. The confidence interval is a range 
of values between the lowest and highest values that the estimated 
parameter could take at a given confidence level. 


Confusing interval width with margin of error. A confidence inter- 
val is always a point estimate plus or minus a margin of error. The 
interval width is double that margin of error. If, for example, a pop- 
ulation parameter is estimated to be 46 percent plus or minus 4 per- 
cent, the interval width is 8 percent. 


Confusing statistics with parameters. Parameters are characteristics 
of the population that you usually do not know; they are designated 
with Greek symbols (u and ©). Statistics are characteristics of samples 
that you are usually able to compute. Although statistics correspond 
to parameters (x is the mean of a sample, as Ul is the mean of a popu- 
lation), the two are not interchangeable; hence, you need to be care- 
ful and know which variables are parameters and which are statistics. 
You compute statistics in order to estimate parameters. 


Confusing the addition rule with the multiplication rule. When 
determining probability, the multiplication rule applies if all favor- 
able outcomes must occur in a series of events. The addition rule 
applies when at least one success must occur in a series of events. 


Forgetting that the addition rule applies to mutually exclusive 
outcomes. If outcomes can occur together, the probability of their 
joint occurrence must be subtracted from the total “addition-rule” 
probability. (See Chapter 4 for more information.) 


Forgetting to average the two middle values of an even-numbered 
set when figuring median and (sometimes) quartiles. If an ordered 
series contains an even number of measures, the median is always the 
mean of the two middle measures. 


116 CliffsQuickReview Statistics 


M™ Forgetting where to place the points of a frequency polygon. The 
points of a frequency polygon are always at the center of each of 
the class intervals, not at the ends. 


\\ 


Appendix B 
TABLES 


This appendix includes the following tables: 
Table 1. Binomial Probabilities, P(x) for n < 20 
Table 2. Standard Normal Probabilities 

Table 3. ¢ Distribution Critical Values 

Table 4. 7 Critical Values 
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Table 2. Standard Normal Probabilities 


(Table entry is probability at or below z.) 


Probability 
Zz 
z .00 -O1 02 -03 -O4 
-3.4 .0003 .0003 .0003 .0003 .0003 
- 3.3 .0005 .0005 .0005 .0004 .0004 
- 3.2 .0007 .0007 .0006 .0006 .0006 
- 3.1 .0010 .0009 .0009 .0009 .0008 
- 3.0 .0013 .0013 .0013 .0012 .0012 
-29 .0019 .0018 .0018 .0017 .0016 
-28 .0026 .0025 .0024 .0023 .0023 
-27 .0035 .0034 .0033 .0032 .0031 
-26 .0047 .0045 .0044 .0043 .0041 
-2.5 .0062 .0060 .0059 .0057 .0055 
-24 .0082 .0080 .0078 .0075 .0073 
-23 .0107 .0104 .0102 .0099 .0096 
-2.2 .0139 .0136 .0132 .0129 .0125 
-2.1 .0179 .0174 .0170 .0166 .0162 
- 2.0 .0228 .0222 .0217 .0212 .0207 
-19 .0287 .0281 .0274 .0268 .0262 
-18 .0359 .0351 0344 .0336 .0329 
-1.7 0446 .0436 .0427 .0418 .0409 
- 1.6 .0548 .0537 .0526 .0516 .0505 
- 1.5 .0668 .0655 .0643 .0630 .0618 
-14 .0808 .0793 .0778 .0764 .0749 
-1.3 .0968 .0951 .0934 .0918 .0901 
- 1.2 .1151 1131 -1112 -1093 -1075 
- 1.1 .1357 -1335 1314 -1292 .1271 
- 1.0 .1587 -1562 -1539 -1515 -1492 
- 0.9 1841 1814 1788 .1762 1736 
- 0.8 .2119 .2090 .2061 .2033 .2005 
- 0.7 .2420 .2389 .2358 .2327 .2296 
- 0.6 2743 .2709 .2676 .2643 .2611 
- 0.5 3085 3050 3015 .2981 .2946 
- 0.4 3446 3409 3372 3336 3300 
- 0.3 3821 3783 3745 3707 3669 
- 0.2 4207 4168 4129 4090 4052 
- 0.1 4602 4562 4522 4483 4443 


- 0.0 .59000 4960 4920 4880 4840 
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.05 -06 .07 08 -09 4 
.0003 .0003 .0003 .0003 .0002 -3.4 
.0004 .0004 .0004 .0004 .0003 - 3.3 
.0006 .0006 .0005 .0005 .0005 - 3.2 
.0008 .0008 .0008 .0007 .0007 -3.1 
0011 0011 0011 .0010 .0010 - 3.0 
.0016 0015 0015 .0014 0014 -29 
.0022 .0021 .0021 .0020 .0019 - 28 
.0030 .0029 .0028 .0027 .0026 -27 
.0040 .0039 .0038 .0037 .0036 - 26 
.0054 .0052 .0051 .0049 .0048 -2.5 
.0071 .0069 .0068 .0066 .0064 -24 
.0094 .0091 .0089 .0087 0084 - 23 
.0122 .0119 .0116 .0113 .0110 -2.2 
.0158 0154 .0150 .0146 .0143 - 2.1 
.0202 .0197 .0192 .0188 .0183 - 2.0 
.0256 .0250 0244 .0239 .0233 -19 
.0322 0314 .0307 .0301 .0294 - 18 
.0401 0392 0384 .0375 .0367 -17 
.0495 0485 .0475 .0465 .0455 - 1.6 
.0606 0594 .0582 0571 .0559 -1.5 
.0735 0721 .0708 .0694 .0681 -14 
.0885 .0869 .0853 .0838 0823 - 1.3 
.1056 1038 .1020 .1003 .0985 - 1.2 
1251 1230 1210 .1190 .1170 -1.1 
.1469 1446 1423 1401 1379 - 1.0 
1711 1685 .1660 1635 1611 - 0.9 
1977 .1949 1922 .1894 .1867 - 0.8 
.2266 .2236 .2206 2177 .2148 - 0.7 
.2578 .2546 2514 .2483 2451 - 0.6 
.2912 .2877 .2843 .2810 .2776 —0.5 
3264 3228 3192 3156 3121 - 0.4 
3632 3594 3557 3520 3483 - 0.3 
4013 3974 3936 3897 3859 - 0.2 
4404 A364 4325 4286 4247 - 0.1 


4801 4761 4721 4681 4641 - 0.0 
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Probability 
4 .00 -O1 02 -03 04 
0.0 .5000 .5040 5080 .5120 .5160 
0.1 .5398 5438 5478 -5517 .5557 
0.2 .5793 5832 .5871 .5910 5948 
0.3 .6179 6217 .6255 .6293 .6331 
0.4 .6554 .6591 .6628 .6664 .6700 
0.5 .6915 .6950 .6985 7019 7054 
0.6 7257 7291 7324 7357 7389 
0.7 .7580 7611 7642 7673 7704 
0.8 .7881 7910 7939 .7967 7995 
0.9 8159 8186 8212 8238 8264 
1.0 8413 8438 8461 8485 .8508 
1.1 8643 .8665 .8686 .8708 .8729 
1.2 8849 .8869 .8888 .8907 .8925 
1.3 .9032 9049 9066 9082 .9099 
1.4 9192 9207 9222 9236 9251 
1.5 9332 9345 9357 .9370 9382 
1.6 9452 9463 9474 9484 9495 
1.7 9554 9564 9573 9582 .9591 
1.8 9641 9649 .9656 9664 .9671 
1.9 9713 .9719 9726 9732 .9738 
2.0 9772 .9778 9783 9788 9793 
2.1 .9821 9826 .9830 9834 .9838 
2.2 .9861 9864 9868 9871 9875 
2.3 .9893 .9896 .9898 9901 9904 
2.4 .9918 .9920 9922 9925 9927 
2.5 .9938 9940 9941 9943 9945 
2.6 9953 9955 .9956 9957 .9959 
2.7 .9965 .9966 .9967 .9968 .9969 
2.8 9974 9975 .9976 9977 9977 
2.9 .9981 9982 9982 9983 9984 
3.0 .9987 9987 9987 .9988 .9988 
3.1 .9990 9991 9991 9991 9992 
3.2 .9993 9993 9994 9994 9994 
3.3 .9995 9995 9995 .9996 .9996 
3.4 .9997 .9997 9997 9997 9997 
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.05 -06 .07 08 -09 ¥4 
.5199 0239 .5279 5319 5359 0.0 
.0596 .0636 .5675 0714 5753 0.1 
5987 .6026 .6064 .6103 6141 0.2 
.6368 .6406 6443 .6480 .6517 0.3 
.6736 6772 .6808 .6844 .6879 0.4 
.7088 7123 7157 .7190 1224 0.5 
7422 7454 7486 7517 7549 0.6 
7734 .7764 7794 7823 7852 0.7 
8023 8051 .8078 .8106 8133 0.8 
8289 8315 .8340 8365 .8389 0.9 
8531 8554 8577 .8599 8621 1.0 
8749 8770 .8790 .8810 .8830 1.1 
8944 8962 .8980 .8997 .9015 1.2 
9115 9131 9147 .9162 .9177 1.3 
9265 .9279 .9292 .9306 .9319 1.4 
.9394 .9406 .9418 9429 9441 1.5 
.9505 9515 9525 9535 9545 1.6 
.9599 .9608 .9616 .9625 .9633 1.7 
.9678 .9686 .9693 .9699 .9706 1.8 
9744 .9750 .9756 .9761 .9767 1.9 
.9798 .9803 .9808 9812 .9817 2.0 
9842 .9846 .9850 9854 .9857 2.1 
.9878 .9881 .9884 .9887 .9890 2.2 
.9906 .9909 9911 .9913 .9916 2.3 
.9929 .9931 .9932 9934 .9936 2.4 
.9946 .9948 .9949 .9951 .9952 2.5 
.9960 .9961 .9962 .9963 .9964 2.6 
.9970 .9971 9972 .9973 .9974 2.7 
.9978 .9979 .9979 .9980 .9981 2.8 
.9984 9985 .9985 .9986 .9986 2.9 
.9989 .9989 .9989 .9990 .9990 3.0 
.9992 .9992 .9992 .9993 .9993 3.1 
.9994 9994 .9995 .9995 .9995 3.2 
.9996 .9996 .9996 .9996 .9997 3.3 


.9997 .9997 .9997 .9997 .9998 3.4 
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CQR REVIEW 


Use this CQR Review to practice what you've learned in this book. After 
you work through the review questions, youre well on your way to achiev- 
ing your goal of understanding basic statistical methods. 


Chapter 2 


1. What is wrong with this pie chart? 


Poor 
a Below 
6% 

Average 


13% 


Excellent 
25% 


Average 
45% 


2. Study the following frequency histogram of the inventory at a hat 
store. The total number of hats is 1,000. Then answer these questions: 


a. How many hats are blue? 

b. How many hats are white or black? 

c. Ifa hat is chosen randomly, what is the chance that it is multi- 
colored? 
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Frequency 
Oo 
ie) 
o 


0.00 T T T T T T T T r 1 1 
white blue red black brown multi 


Hat Color 


3. True or False: “Median” and “mean” are two terms for the same 
quantity. 

4. True or False: “Variance” and “standard deviation” are two terms for 
the same quantity. 


Chapter 4 


5. True or False: The binomial distribution applies only to events that 
have two outcomes: “success” or “failure.” 


6. A fair die, with six sides, is tossed twice. The chance of rolling two 


sixes is 
a. 2/6 
b. 1/6 
ec. 1/12 
d. 1/36 
e. 1/4 
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7. Two fair dice, with six sides each, are rolled. The probability that the 
sum of the dice equals 12 is 


a. 0 

b. 1 

c. 1/6 
d. 1/12 
e. 1/36 


8. Two fair dice, with six sides each, are rolled. The probability that the 
sum of the dice equals 7 is 


a. 0 

b. 1 

c. 1/6 
d. 1/12 
e. 1/36 


9. A population has a mean of 1 and a variance of 4. The best you can 
say about the probability of randomly choosing a value greater than 
7 from this sample is that it is 


0 

less than 0.15 percent 
less than 0.3 percent 
less than 32 percent 
e. less than 16 percent 


Bos fp 


10. A slot machine has four windows, and each can contain one of eight 
different symbols. To win the jackpot, all four windows must con- 
tain the same symbol. The chance of a jackpot is 


0 
1/8 
(1/8)? 
1/4 

1 


pao 


Chapter 5 


11. True or False: In a normally distributed population, the standard devi- 
ation may never be larger than the mean. 


12. True or False: For all practical purposes, and % are the same thing. 
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13. You think that the mean of your population falls in the range between 
—1 and 1 and want to test this idea. Your null hypothesis is Ho: 


Bop 


e. 


ae) Re a | 
-l<usl 
uz 0 

uz I,u# —l 
-12%21 


14. Which normal curve pictured below has the greatest standard 
deviation? 


1.55 


0.55 


Chapter 6 


15. True or False: If two variables are correlated, then they must be 
causally related. 


16. Rejecting the null hypothesis when it is actually true is 


paore 


no error 

a Type I error 

a Type II error 

neither a Type I nor a Type II error 
impossible 
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Chapter 7 


17. A machine for filling beer bottles is designed to put 12 ounces of beer 
in each bottle, with a standard deviation of 0.05 ounce. How sure 
can you be that your beer contains at least 11.9 ounces? 


18. Suppose that there are only two candidates (A and B) for president 
and that in a particular state 5,000,000 votes will be cast. 


a. What percentage of the total votes should a TV station count, 
to be sure to within +1 percent that candidate A has won, at a 
95 percent confidence level? 

b. The first 20,000 votes are rapidly counted. Candidate A has 
received 52 percent and candidate B has received 48 percent of 
the vote. What is the width of the 99.9 percent confidence inter- 
val on candidate A’s result, as a percentage of the whole 5,000,000 
votes? 

c. Comment on the methods of projection in part a. Is it a good 
idea to use the first votes cast to project the winner? 


19. At a major airport, a researcher studies the average length of delays 
(excluding flights that leave on time). His data is shown below, for 
ten randomly chosen days. 


Day 1 2 3 4 5 6 7 8 9 10 


Average delay 2.0 1.8 0.32 041 2.3 0.72 0.22 0.28 0.57 3.2 
(hours) 


a. Calculate the mean and standard deviation of the delays, as well 
as the median. 

b. What is the 95 percent confidence interval on the average length 
of delay? 

c. Compare the mean and median from part a. What is the p value 
for a delay of the median size? 

d. Do you think this population is normally distributed? 


Chapter 8 


20. True or False: If the correlation coefficient for a set of data is zero, 
then there is no relationship between the dependent and indepen- 
dent variables. 


21. True or False: The x’ test only tells you if two variables are 
independent. 
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22. A microbiologist is testing a new antibiotic she developed against a 
common one. She applies each drug to different plates of bacterial 
cultures and measures the fraction of bacteria that dies after one day. 
Her results are given in the following table. 


Dose of Drug (mg) 0 Os MeO bBo iO + SP ae 


New drug 2.0 12.3 27.1 32.2 48.9 56.6 60.6 
(% dead) 
Old drug 3.1 60 16.5 12.3 25.9 29.0 42.5 
(% dead) 


Using the information in the preceding table, answer these questions: 


a. Plot the results for each drug and calculate the coefficient of 
correlation for each drug. 

b. What is the significance level of each of these correlations? 

c. Calculate the slope for each plot. 

d. Do the 95 percent confidence intervals for the two slopes overlap? 


23. Consider the following data on the color of a dog and the average 
annual income of the dog’s owner. Compute a X” value for this data 
and determine the p-level to which these variables are related. 


$0-$25,000 $25,001- 50,000 $50,001-75,000 $75,000+ Totals 


White 10 12 17 23 62 
Black 15 28 13 11 67 
Orange/golden 7 9 11 15 42 
Mixed 24 26 32 42 124 
Total 56 75 73 91 295 


Answers: 1. More than 100% is shown. 2a. 75 b. 750 c. 0.025 3. False 
4, False 5. True 6. d 7. ec 8.c 9. b 10. c 11. False 12. False 13. a 14. c 
15. False 16. b 17. 97.72% 18a. 0.192% b. 2.2% 19a. 1.18; 1.06; .65 
b. (0.42,1.94) « .05<p<.1ld.no 20. False 21. True 22a. .995; .949 
b. p < .0005; .0005 < p < .001 c. 20.45; 12.28 d. no 23. x = 17.04; 
025 <p<.05. 


CQR RESOURCE CENTER 


CQR Resource Center offers the best resources available in print and online 
to help you study and review the core concepts of statistics and statistical 
methods. You can find additional resources, plus study tips and tools to 
help test your knowledge, at www.cliffsnotes.com. 


Books 


There are many books available on basic statistical methods. Many use 
humor to get across the concepts. Although most of these concepts were 
covered in CQR Statistics, if youre still struggling, one of these may help. 


Excel 2000 For Windows For Dummies, by Greg Harvey. Once you've 
got your data, how can you analyze it efficiently? After all, if you've 
got more than ten samples, you don’t want to be calculating by hand. 
You need a spreadsheet, and Excel performs good statistical analysis, 
including all of the basic techniques from CQR Statistics. This book 
shows you how to use Excel effectively and how to perform the 
statistical tests you are interested in. Hungry Minds, $21.99. 


The Cartoon Guide to Statistics, by Larry Gonick and Woollcott Smith. 
This book offers much of the same information as CQR Statistics— 
but with cartoons, too! It may sound silly, but it can really help to 
illustrate the concepts. Harper Collins, $16.00. 


Chances Are: The Only Statistics Book You'll Ever Need, by Steve 
Slavin. This book is designed to make statistics accessible to people 
with little mathematical background. If your math skills are holding 
you up, check this book out. Madison Books, $16.95. 


How to Lie with Statistics, by Darrell Huff. This is a classic, and any list 
of statistics book would be incomplete without it. Huff shows how 
badly analyzed and interpreted data can be used to sell products or 
push agendas, providing good examples of the misuses and abuses of 
statistics. Worth a look, to keep from repeating the errors he points 
out. W.W. Norton & Co., $9.25. 
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Hungry Minds also has three Web sites that you can visit to read about all 
the books we publish: 


BM www.cliffsnotes.com 
HB www.dummies.com 
BM www.hungryminds.com 


Internet 
Visit the following Web sites for more information about statistics: 


American Statistical Association—www.amstat .org—This site is really 
more interesting than you might think. There is a nice discussion of 
“What statisticians do,” career opportunities in statistics, and also 
some educational resources. The Association offers online courses and 
maintains a list of other Web sites related to statistics (click on Center 
for Statistics Education or Links & Resources). 


Portraits of Statisticians—www.york.ac.uk/depts/maths/histstat/ 
people/—If you want to see where all this statistics stuff came from, 
well, here are pictures or portraits of famous statisticians, past and 
present. An interesting site put together by the mathematics 
department at the University of York in the United Kingdom. 


Concepts and Applications of Inferential Statistics—nht tp: //faculty. 
vassar.edu/~lowry/webtext.html—This site is a complete 
statistics textbook! If you want to review the concepts in the CQR 
Statistics book or move beyond into some more advanced concepts, 
this is worth a look. The explanations are clear and examples abound. 
The site is maintained by Professor Richard Lowry at Vassar College. 


Statistical Methods  (Index)—http://bubl.ac.uk/link/s/ 
statisticalmethods.htm—This is actually a page from the BUBL 
Catalog on Internet Resources, a larger site maintained by the Centre 
for Digital Library Research at Strathclyde University in the United 
Kingdom. On this page, you'll find many links to pages dedicated to 
statistical methods. The BUBL page is well organized, with a 
description of the page that each link leads to. If the Web pages 
mentioned in CQR Statistics material don't have what you're looking 
for, try one of the links on this page. 


World Lecture Hall: Statistics—www.utexas.edu/world/lecture/ 
statistics/—From the University of Texas, this site contains links 
to statistics course Web pages from universities around the world. 
Some of the links are non-functional, but most lead to a page with 
lecture notes, exercises, and other statistics material. 
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Note: The Internet is loaded with statistical information. Most it, how- 
ever, is descriptive statistics. These pages include all sorts of government 
statistics—from the U.S. Bureau of Transportation (www.bts.gov), to the 
Bureau of Justice (www.ojp.usdoj.gov/bjs/). There is a vast amount 
of information, to which you could apply the statistical methods you 


learned with this book. 


Next time you're on the Internet, don’t forget to drop by www. 
cliffsnotes.com. We created an online Resource Center that you can 
use today, tomorrow, and beyond. 


Glossary 


addition rule for mutually exclusive 
random events, the chance of at least 
one of them occurring is the sum of 
their individual probabilities. 


alternative hypothesis a research 
hypothesis; the hypothesis that is sup- 
ported if the null hypothesis is 
rejected. 


bar chart a graphic that displays how 
data fall into different categories or 
groups. 


bell-shaped. curve symmetrical, single- 
peaked frequency distribution. Also 
called the normal curve or gaussian 
curve. 


bias the consistent underestimation or 
overestimation of a true value, because 
of preconceived notion of the person 
sampling the population. 


bimodal curve with two equal scores 
of highest frequency. 


binomial event with only two possible 
outcomes. 


binomial probability distribution 
for binomial events, the frequency of 
the number of favorable outcomes. For 
a large number of trials, the binomial 
distribution approaches the normal 
distribution. 


bivariate involving two variables, 
especially, when attempting to show a 
correlation between two variables, the 
analysis is said to be bivariate. 


box plot (box-and-whiskers) a 
graphic display of data indicating sym- 
metry and central tendency. 


Central Limit Theorem a rule that 
states that the sampling distribution of 
means from any population will be 
normal for large sample 7. 


chi-square a probability distribution 
used to test the independence of two 
nominal variables. 


class frequency the number of obser- 
vations that fall into each class interval. 


class intervals categories or groups 
contained in frequency graphics. 


coefficient of determination a mea- 
sure of the proportion of each other’s 
variability that two variables share. 


confidence interval the range of val- 
ues that a population parameter could 
take at a given level of significance. 


confidence level the probability of 
obtaining a given result by chance. 


continuous variable a variable that 
can be measured with whole numbers 
and fractional (or decimal) parts 
thereof. 


correlated two (or more) quantities 
that change together in a consistent 
manner. Thus, if the value of one 
variable is known, the other can be 
immediately determined from their 
relationship. 


correlation coefficient a measure of 
the degree to which two variables are 
linearly related. 


critical value the value of a computed 
statistic used as a threshold to decide 
whether the null hypothesis will be 
rejected. 
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data numerical information about 
variables; the measurements or obser- 
vations to be analyzed with statistical 
methods. 


degrees of freedom a parameter used 
to help select the critical value in some 
probability distributions. 


dependent events events such that the 
outcome of one has an effect on the 
probability of the outcome of the 
other. 


dependent variable a variable that is 
caused or influenced by another. 


descriptive statistics numerical data 
that describe phenomena. 


deviation the distance of a value in a 
population (or sample) from the mean 
value of the population (or sample). 


directional test a test of the prediction 
that one value is higher than another; 
also called a one-tailed test. 


discrete variable a variable that can be 
measured only by means of whole 
numbers; or one which assumes only a 
certain set of definite values, and no 
others. 


disjoint occurrence both outcomes 
unable to happen at the same time. 


distribution a collection of measure- 
ments; how scores tend to be dispersed 
about a measurement scale. 


dot plot a graphic that displays the 
variability in a small set of measures. 


double counting a mistake encoun- 
tered in calculating the probability of 
at least one of several events occurring, 
when the events are not mutually 
exclusive. In this case, the addition 
rule does not apply. 


empirical rule a rule that is founded 
on observation, without a theoretical 
basis. Or a “rule of thumb.” 


frequency distribution the frequency 
of occurrence of the values of a vari- 
able. For each possible value of the 
variable, there is an associated fre- 
quency with which the variable 
assumes that value. 


frequency histogram a graphic that 
displays how many measures fall into 
different classes, giving the frequency 
at which each category is seen 
observed. 


frequency polygon a graphic presen- 
tation of frequency of a phenomenon 
that typically uses straight lines and 
points. 


grouped data data that has been 
sorted into categories, usually in order 
to construct a frequency histogram. 


grouped measures a set of values that 
belong to the same class. 


histogram a graphic presentation of 
frequency of a phenomenon. 


independent events events such that 
the outcome of one has no effect on 
the probability of the outcome of the 
other. 


independent variable a variable that 
causes, or influences, another variable. 


inference conclusion about a popula- 
tion parameter based upon analysis of 
a sample statistic. Inferences are always 
stated with a confidence level. 


intercept the value of y at which a line 
crosses the vertical axis. 


interquartile range set of measures 
lying between the lower quartile (25th 
percentile) and the upper quartile 
(75th percentile), inclusive. 


interval a scale using numbers to rank 
order; its intervals are equal but with 
an arbitrary 0 point. 


joint occurrence both outcomes hap- 
pening simultaneously; P(AB). 


least squares any line- or curve-fitting 
model that minimizes the squared dis- 
tance of data points to the line. 


lower quartile (Q,), the 25th per- 
centile of a set of measures. 


mean the sum of the measures in a 
distribution divided by the number of 
measures; the average. 


measures of central tendency de- 
scriptive measures that indicate the 
center of a set of values, for example, 
mean, median, and mode. 


measures of variation descriptive 
measures that indicate the dispersion 
of a set of values, for example, vari- 
ance, standard deviation, and standard 
error of the mean. 


median the middle measure in an 
ordered distribution. 


middle quartile (Q,), the 50th per- 
centile of a set of measures; the 
median. 


mode most frequent measure in a dis- 
tribution; the high point on a fre- 
quency distribution. 


mound-shaped curve see bell-shaped 
curve. 
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multiplication rule the probability of 
two or more independent (hence, not 
mutually exclusive) events all occur- 
ring is the product of their individual 
probabilities. 


mutually exclusive events such that 
the occurrence of one precludes the 
occurrence of the other. 


negative relationship a relationship 
between two variables such that when 
one increases, the other decreases. 


negatively skewed curve a probability 
or frequency distribution that is not 
normal, but rather is shifted such that 
the mean is less than the mode. 


nominal a scale using numbers, sym- 
bols, or names to designate different 
subclasses. 


non-directional test a test of the pre- 
diction that two values are equal or a 
test that they are not equal; a two- 
tailed test. 


non-parametric test statistical test 
used when assumptions about normal 
distribution in the population cannot 
be met, or when the level of measure- 
ment is ordinal or less. For example, 
the y-square test. 


normal distribution smooth bell- 
shaped curve symmetrical about the 
mean such that its shape and area obey 
the empirical rule. 


null hypothesis the reverse of the 
research hypothesis. The null hypothe- 
sis is directly tested by statistical analy- 
sis so that it is either rejected or not 
rejected, with a confidence level. If the 
null hypothesis is rejected, the alterna- 
tive hypothesis is supported. 
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numerical statistics statistical parame- 
ters presented as numbers (as opposed 
to pictorial statistics). 


ogive a graphic that displays a running 
total. 


one-tailed test a test of the prediction 
that one value is higher than another. 


ordinal a scale using numbers or sym- 
bols to rank order; its intervals are 
unspecified. 


outlier a data point that falls far from 
most other points; a score extremely 
divergent from the other measures of 
a set. 


parameter a characteristic of a popula- 
tion. The goal of statistical analysis is 
usually to estimate population parame- 
ters, using statistics from a sample of 
the population. 


Pearson’s product moment coefficient 
identical to the correlation coefficient. 


percentile the value in an ordered set 
of measurements such that P% of the 
measures lie below that value. 


pictorial statistics statistical parameters 
that are presented as graphs or charts 
(as opposed to simply as numbers). 


pie chart a graphic that displays parts 
of the whole, in the form of a circle 
with its area divided appropriately. 


point estimate a number computed 
from a sample to represent a popula- 
tion parameter. 


population a group of phenomena 
that have something in common. The 
population is the larger group, whose 
properties (parameters) are estimated 


by taking a smaller sample from 
within the population, and applying 
statistical analysis to the sample. 


positive relationship a relationship 
between two variables such that when 
one increases, the other increases, or 
when one decreases, the other 
decreases. 


positively skewed curve a probability 
or frequency distribution that is not 
normal, but rather is shifted such that 
the mean is greater than the mode. 


power the probability that a test will 
reject the null hypothesis when it is, in 


fact, false. 


probability a quantitative measure of 
the chances for a particular outcome 
or outcomes. 


probability distribution a smooth 
curve indicating the frequency distribu- 
tion for a continuous random variable. 


proportion for a binomial random 
event, the probability of a successful 
(or favorable) outcome in a single trial. 


qualitative variable phenomenon 
measured in kind, that is, non- 
numerical units. For example, color is 
a qualitative variable, because it cannot 
be expressed simply as a number. 


quantitative variable phenomenon 
measured in amounts, that is, numeri- 
cal units. For example, length is a 
quantitative variable. 


random an event for which there is no 
way to know, before it occurs, what 
the outcome will be. Instead, only the 
probabilities of each possible outcome 
can be stated. 


random error error that occurs as a 
result of sampling variability, through 
no direct fault of the sampler. It is a 
reflection of the fact that the sample is 
smaller than the population; for larger 
samples, the random error is smaller. 


range difference between the largest 
and smallest measures of a set. 


ratio a scale using numbers to rank 
order; its intervals are equal, and the 
scale has an absolute 0 point. 


region of acceptance the area of a 
probability curve in which a computed 
test statistic will lead to acceptance of 
the null hypothesis. 


region of rejection the area of a prob- 
ability curve in which a computed test 
statistic will lead to rejection of the 
null hypothesis. 


regression a statistical procedure used 
to estimate the linear dependence of 
one or more independent variables on 
a dependent variable. 


relative frequency the ratio of class 
frequency to total number of measures. 


relative frequency principle of prob- 
ability if a random event is repeated a 
large number of times, then the pro- 
portion of times that a particular out- 
come occurs is the probability of that 
outcome occurring in a single event. 


research hypothesis a prediction or 
expectation to be tested. If the null 
hypothesis is rejected, then the 
research hypothesis (also called alter- 
native hypothesis) is supported. 


residual the vertical distance between 
a predicted value y and its actual value. 
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sample a group of members of a popu- 
lation selected to represent that popula- 
tion. A sample to which statistical 
analysis is applied should be randomly 
drawn from the population, to avoid 
bias. 


sampling distribution the distribu- 
tion obtained by computing a statistic 
for a large number of samples drawn 
from the same population. 


sampling variability the tendency of 
the same statistic computed from a 
number of random samples drawn 
from the same population to differ. 


scatter plot a graphic display used to 
illustrate degree of correlation between 
two variables. 


skewed a distribution displaced at one 
end of the scale and a tail strung out at 
the other end. 


slope a measure of a line’s slant. 


standard deviation a measure of data 
variation; the square root of the 
variance. 


standard error a measure of the ran- 
dom variability of a statistic, such as 
the mean (i.e., standard error of the 
mean). The standard error of the mean 
is equal to the standard deviation 
divided by the square root of the sam- 
ple size (7). 


standardize to convert to a zscore. 


statistic a characteristic of a sample. A 
statistic is an estimate of a population 
parameter. For larger samples, the 
statistic is a better estimate of the 
parameter. 
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statistical significance the probability 
of obtaining a given result by chance. 
High statistical significance does not 
necessarily imply importance. 


statistics a branch of mathematics that 
describes and reasons from numerical 
observations; or descriptive measures 
of a sample. 


stem-and-leaf graphic display that 
shows actual scores as well as distribu- 
tion of classes. 


symmetry a shape such that one side 
is the exact mirror image of the other. 


symmetric distribution a probability 
or frequency distribution that has the 
property in which the mean, median, 
and mode are all the same value. 


systematic error the consistent under- 
estimation or overestimation of a true 
value, due to poor sampling technique. 


t-distribution a probability distribu- 
tion often used when the population 
standard deviation is not known or 
when the sample size is small. 


tabled value the value of a computed 
statistic used as a threshold to decide 
whether the null hypothesis will be 
rejected. 


test statistic a computed quantity 
used to decide hypothesis tests. 


two-tailed test a test of the prediction 
that two values are equal, or a test that 
they are not equal. 


Type I error rejecting a null hypothe- 
sis that is, in fact, true. 


Type II error failing to reject a null 
hypothesis that is, in fact, false. 


upper quartile (Q, ), the 75th per- 
centile of a set of measures. 


value a measurement or classification 
of a variable. 


variable an observable characteristic of 
a phenomenon that can be measured 
or classified. 


variance a measure of data variation; 
the mean of the squared deviation scores 
about the means of a distribution. 


z-score a unit of measurement 
obtained by subtracting the mean and 
dividing by the standard deviation. 


A 


addition rule, 41, 115, 145 
alternative hypothesis, 62, 145 
assumptions, regression, 108-109 
average, 24. See also mean 
axes 

frequency distribution, 15 

histograms, 14 

intercept, 102-103 

slope, 103 


bar charts, 8, 9-10, 21, 145 
bell-shaped curves, 16, 145 
bias (systematic errors), 53, 145 
bimodal curves, 17, 145 
binomial events, 145 
binomial probabilities, 118-127, 145 
binomial tables, 47 
binomials 
discrete variables, 46-48 
normal approximation, 58-60 
bivariate relationships, 145 
correlation, 96-102 
box plots, 18-20, 145 
box-and-whiskers, 18-20 


Cc 


Central Limit Theorem, 54, 145 
central tendency. See measures of central 
tendency 
charts 
bar charts, 8, 9-10 
box plots, 18-20 
dot plots, 10 
frequency distributions, 15-17 
frequency polygons, 14-15 
ogive, 11-12 
pie charts, 8, 10-11 
scatter plots, 20-21 
stem-and-leaf, 17-18 


Index 


chi-square (y”), 110-112, 145 
critical values, 134-135 
class frequency, 145 
class intervals, 145 
classes, histograms 
frequency, 13 
intervals, 12 
classic theory of probability, 37-38 
coefficient of determination, 102 
conditional probability, 44 
confidence intervals, 70-72, 145 
populations, 78, 82 
regression, 107-108 
slope, 105-107 
confidence levels, 145 
continuous variables, probability distribution 
and, 46, 145 
correlation, 96-102, 145 
correlation coefficient, 145. See also Pearson's 
product moment coefficient 
critical value, test statistic, 64, 145 
curves 
frequency distribution, 16-17 
frequency distribution, bimodal, 17 
negatively skewed, 147 
positively skewed, 148 
variation and, 31-32 


data, 8, 146 
dependent events, probability of, 39-40, 146 
dependent variables, 146 
descriptive statistics, 146 
deviation from the mean, 146 
percentile, 33 
standard deviation, 31 
variation, 30-31 
df (degrees of freedom), 80, 115, 146 
directional tests, 66-67, 146 
discrete variables, 45-48, 146 
disjoint outcomes, 38, 146 
distributions, 52-53, 146 
dot plot charts, 10, 146 
double counting, probability and, 42-43, 146 


152 CliffsQuickReview Statistics 


emprical rule, 146 
errors 
random, 53 
systematic, 53 
Type I, 150 
Type H, 150 
estimates, hypotheses, 72-73 


failures, binomial variables, 58 
fair coins, probability and, 7 
fixed coins, probability and, 7 
formulae 

central tendency, 25 

shortcuts, variance/deviation, 32-33 
frequency distributions, 15-17, 146 
frequency histogram, 12-13, 146 
frequency polygon, 14-15, 116, 146 
frequency, class frequency, 145 


graphics, pictorial statistics, 3 
graphs, histograms, 8. See also charts 
grouped data, 25-28, 146 
grouped measures, 146 

mean and, 25-26 


histograms, 8, 146 
frequency histogram, 12-13 
relative frequency histograms, 13-14 
horizontal bar charts, 8 
hypotheses 
alternative hypothesis, 62 
confidence intervals, 70-72 
estimating difference score, 72-73 
null hypothesis, 62, 63 
point estimates, 70-72 
research hypothesis, 62-63 
stating, 62-63 
statistical significance, 69-70 
test statistic, 63-66 
Type I errors, 68-69, 150 
Type I errors, 68-69, 150 
univariate tests, 73-75 


independent events, probability of, 39, 146 
independent variables, 146 
inference, 4, 146 
inferential tests, univariate, 76-95 
intercept, 102-103, 146 
interval scale, 34, 147 
IQR (interquartile range), 18, 147 
deviation, 34 
variation, 34 


J 


joint occurrences, probability of, 40-41, 147 
J-shaped curves, 16-17 


L 


least squares, regression, 105, 147 
lower quartile (Q;), 147 

box plots, 18, 19 

deviation, 33-34 


variation, 33-34 


margin of error, 115 
mean, 3, 147. See also deviation from the mean 
binomials, 47 
central tendency, 23, 24 
comparing, #tests, 84-87 
comparing, z-tests, 83-84 
grouped measures, 25-26 
standard error, 54 
symmetric distribution, 28 
measurement scales, 34—35 
measures of central tendency, 21-29, 147 
measures of variation, 29-34, 147 
median value, 115, 147 
box plots, 18, 19 
central tendency, 23, 24-25 
grouped data, 26-28 
outliers and, 24 
symmetric distribution, 28 
middle quartile (Q,), 147 
deviation, 33-34 
variation, 33-34 
mode, 147 
central tendency, 23, 25 
symmetric distribution, 28 


mound-shaped curves, 16, 147 

multiplication rule, joint occurrences and, 
40-41, 115, 147 

mutually exclusive outcomes, 38, 115, 147 


negative relationships, 21, 147 
negatively skewed curves, 147 
nominal scale, 34, 147 
non-directional tests, 66-67, 147 
non-mutually exclusive outcomes, 42 
non-parametric tests, 110-112, 147 
normal approximation to binomials, 58-60 
normal curves, 31-32 

properties, 54-58 

z-score, 56-57 
normal distributions, 31-32, 147 
normal tables, 114 
null hypothesis, 62, 63, 147 


numerical statistics, 3, 148 


ogive, 11-12, 22, 148 
one-sample #test, 79-82 
one-sample z test, 76-79 
one-tailed tests, 66-67, 114, 148 
ordinal scale, 34, 148 
organization, 4 
outliers, 148 
central tendency, 24 
stem-and-leaf, 17 


P 


paired difference test, 87-89 
parameters, 50-51, 115, 148 
Pearson’s product moment coefficient (7), 
98-102, 148 
percentile, 33, 148 
pictorial statistics, 3-4, 148 
pictures. See graphs, histograms 
pie charts, 8, 10-11, 148 
point estimates, hypotheses, 70-72, 148 
pooled variance method, population, 86-87 
population, 5, 50-51, 148 
Central Limit Theorem, 54 
confidence interval, 78, 82 
single population proportion, 89-91 
positive relationships, 21, 148 
positively skewed curves, 29, 148 
power, hypothesis errors, 69, 148 
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predictions, 5-6 
confidence level, 107-108 
probability, 6-7, 37-38, 148 
conditional, 44 
dependent events, 39-40 
double counting, 42-43 
independent events, 39 
joint occurrences, 40-41 
non-mutually exclusive outcomes, 42 
relative frequency principle, 38, 149 
simple events, 38-39 
probability distributions, 44-45, 148 
continuous variables, 46 
discrete variables, 45-46 
process of statistics gathering, 4 
properties of normal curve, 54-58 
proportion, 148 
binomial experiments, 46 
proportional frequency, histograms, 14 


Q 


qualitative data, 15, 148 
quantitative data, 15, 148 
quartiles, 115 


r (Pearson’s product moment), 98-102 
random error, 53, 149 
randomness, 148 
samples, 51 
ranges, 149 
histograms, 12 
variation, 30 
rank order, ordinal scale, 34 
ratio scale, 35, 38, 149 
region of acceptance, 65, 149 
region of rejection, 65, 149 
regression, 102-109, 149 
relative frequency histograms, 13-14, 149 
relative frequency principle of probability, 
38, 149 
research hypothesis, 62-63, 149 
residual, regression, 105, 149 
resources, 142-144 
results comparison, 6 


samples, 5, 50-51, 149 
central limit theorem, 54 


sampling distributions, 52-53, 149 
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sampling variability, 52-53, 149 
scales. See measurement scales 
scatter plots, 20-21, 149 
correlation, 96-97 
single population proportion, 89-91 
skewed curves, 16, 28, 149 
slope, 103, 105-107, 149 
spread, 30. See also measures of variation 
square root, 31 
standard deviation, 31-32, 47, 114, 149 
standard error, 54, 149 
standard normal probabilities, 128-131 
standard normal tables, 114 
standardization, 149 
statistical insurance methods, 4 
statistical significance, hypotheses, 
69-70, 150 
statistics, 50-51, 115, 149, 150 
descriptive statistics, 146 
samples, 51 
types, 3-4 
stem-and-leaf chart, 17-18, 150 
successes, binomial variables, 58 
sum notation, 25 
symmetric distribution, 28, 150 
symmetry, 150 
box plot, 18 
curves, 16 
systematic error, 53, 150 


T 


t-distribution, 79, 150 

critical values, 132-133 
ttests, 79-89, 115 
tabled value, 64, 150 
test statistics, 63-67, 150 
testing, 73-95 
two-tailed tests, 66-67, 114, 150 
Type I errors, 68-69, 150 
Type I errors, 68-69, 150 


univariate tests, 73-75 
upper quartile (Q;), 150 
box plots, 18, 19 
deviation, 34 
variation, 34 


V 


values, 150 
axes, 15 
pairs, interval scale, 34-35 
variables, 150 
variance, 114, 150 
variation, 30-31 
variation. See also measures of variation 
lower quartile (Q,), 33-34 
middle quartile (Q,), 33-34 


vertical bar charts, 9 


Web sites, 143-144 


X 


x axis, 15 


slope, 103 


Y 


yaxis, 15 
slope, 102-103 


Z 


z-score, 150 

normal curves and, 56-57 
&-tests 

comparing two means, 83-84 


one-sample, 76-79 


